{"id":55210,"date":"2026-04-03T11:50:07","date_gmt":"2026-04-03T11:50:07","guid":{"rendered":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-module-2\/lessons\/lesson-1-what-is-data-cleaning-3-2\/"},"modified":"2026-04-03T11:50:07","modified_gmt":"2026-04-03T11:50:07","slug":"lesson-1-what-is-data-cleaning-3-2","status":"publish","type":"lesson","link":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/","title":{"rendered":"Lesson 1: What is data cleaning?"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" width=\"96\" height=\"114\" src=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png\" alt=\"\" srcset=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png 96w, https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1-10x12.png 10w, https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1-42x50.png 42w\" sizes=\"auto, (max-width: 96px) 100vw, 96px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/p>\n<h2>Lesson 1: Introduction to Data Analysis<\/h2>\n<h3>What is data cleaning?<br \/>\n<\/h3>\n<h5>Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets.<\/h5>\n<h5>It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or unreliable results if left unchecked.<\/h5>\n<h5>The process of data cleaning typically involves several steps, including removing duplicates, correcting misspellings and typos, handling missing or null values, standardizing data formats, and identifying and dealing with outliers or anomalies. Data cleaning may also involve verifying that data conforms to a set of predefined rules or constraints, such as data integrity constraints or business rules.<\/h5>\n<h5>The goal of data cleaning is to ensure that data is accurate, consistent, and reliable, so that it can be used effectively in analysis and decision-making. Data cleaning is often a time-consuming and labor-intensive process, but it is an important step in ensuring the quality and reliability of data.<\/h5>\n<h3>What is dirty data?<\/h3>\n<h5>Dirty data is essentially any data that needs to be manipulated or worked on in some way before it can be analysed.<\/h5>\n<h5>Some types of dirty data include:<\/h5>\n<details id=\"e-n-accordion-item-5440\" open>\n<summary data-accordion-index=\"1\" tabindex=\"0\" aria-expanded=\"true\" aria-controls=\"e-n-accordion-item-5440\" >\n\t\t\t\t\t  Incomplete data<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">For example, a spreadsheet with missing values that would be relevant for your analysis.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">If you&#8217;re looking at the relationship between customer age and a number of monthly purchases, you&#8217;ll need data for both of these variables.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">If some customer ages are missing, you&#8217;re dealing with incomplete data.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-5441\" >\n<summary data-accordion-index=\"2\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5441\" >\n\t\t\t\t\t Duplicate data<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>For example,records that appear twice (or multiple times) throughout the same dataset. This can occur if you&#8217;re combining data from multiple sources or databases.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-5442\" >\n<summary data-accordion-index=\"3\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5442\" >\n\t\t\t\t\t Inconsistent or inaccurate data<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>data that is outdated or contains structural errors such as typos, inconsistent capitalization, and irregular naming conventions.<\/h5>\n<h5>Say you have a dataset containing student test scores, with some categorized as &#8220;Pass&#8221; or &#8220;Fail&#8221; and others categorized as &#8220;P&#8221; or &#8220;F.&#8221;<\/h5>\n<h5>Both labels mean the same thing, but the naming convention is inconsistent, leaving the data rather messy.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-5443\" >\n<summary data-accordion-index=\"4\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5443\" >\n\t\t\t\t\t Misaligned data<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>Misaligned data refers to the situation where data is placed in the wrong fields or columns in a dataset.<\/h5>\n<h5>For example, imagine a dataset that includes information about employees in a company, where the salary data is placed in the field intended for employee names.<\/h5>\n<h5>This type of error can be caused by various reasons, such as manual data entry errors, technical issues in data import or export, or formatting problems.<\/h5>\n<p>\u00a0<br \/>\n\t\t\t\t\t<\/details>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">For example, a spreadsheet with missing values that would be relevant for your analysis.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">If you&#8217;re looking at the relationship between customer age and a number of monthly purchases, you&#8217;ll need data for both of these variables.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">If some customer ages are missing, you&#8217;re dealing with incomplete data.<\/h5>\n<h5>For example,records that appear twice (or multiple times) throughout the same dataset. This can occur if you&#8217;re combining data from multiple sources or databases.<\/h5>\n<h5>data that is outdated or contains structural errors such as typos, inconsistent capitalization, and irregular naming conventions.<\/h5>\n<h5>Say you have a dataset containing student test scores, with some categorized as &#8220;Pass&#8221; or &#8220;Fail&#8221; and others categorized as &#8220;P&#8221; or &#8220;F.&#8221;<\/h5>\n<h5>Both labels mean the same thing, but the naming convention is inconsistent, leaving the data rather messy.<\/h5>\n<h5>Misaligned data refers to the situation where data is placed in the wrong fields or columns in a dataset.<\/h5>\n<h5>For example, imagine a dataset that includes information about employees in a company, where the salary data is placed in the field intended for employee names.<\/h5>\n<h5>This type of error can be caused by various reasons, such as manual data entry errors, technical issues in data import or export, or formatting problems.<\/h5>\n<p>\u00a0\t\t<\/p>\n<h3>What are some key steps in the data-cleaning process?<br \/>\n<\/h3>\n<h5>We&#8217;ve established how important the data-cleaning stage is.<\/h5>\n<h5>Now let&#8217;s introduce some data-cleaning techniques! To clean your data, you might do some or all of the following:<\/h5>\n<details id=\"e-n-accordion-item-2670\" open>\n<summary data-accordion-index=\"1\" tabindex=\"0\" aria-expanded=\"true\" aria-controls=\"e-n-accordion-item-2670\" >\n\t\t\t\t\t  Delete Unnecessary Columns<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">Chances are, your dataset will contain some values that aren&#8217;t relevant to your analysis<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">For example, in an analysis of students&#8217; test scores compared to hours spent studying, things like student ID number and date of birth aren&#8217;t relevant.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">You could simply delete the columns containing this data.<\/h5>\n<p>\u00a0<br \/>\n\t\t\t\t\t<\/details>\n<details id=\"e-n-accordion-item-2671\" >\n<summary data-accordion-index=\"2\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2671\" >\n\t\t\t\t\t Identify and remove duplicates.<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>\u00a0Duplicate data tends to occur during the data collection phase, so it&#8217;s important to filter them out.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-2672\" >\n<summary data-accordion-index=\"3\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2672\" >\n\t\t\t\t\t Deal with missing data.<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>In the case of missing data, you can either delete the entire entry associated with it (i.e. delete the whole row which contains the empty cell), impute the missing value based on other data, or flag all missing data as such by entering &#8220;0&#8221; or &#8220;missing&#8221; in the respective cell.\u00a0<\/h5>\n<h5>Each method for handling missing data has implications for your analysis, so you&#8217;ll need to choose your approach carefully.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-2673\" >\n<summary data-accordion-index=\"4\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2673\" >\n\t\t\t\t\t Remove unwanted outliers.<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>Outliers are values that differ significantly from other values in your data.<\/h5>\n<h5>For example, if you see that most student test scores fall between 50 and 80, but that one student has scored a 2, this might be considered an outlier.\u00a0<\/h5>\n<h5>Outliers may be the result of an error, but that&#8217;s not always the case, so approach with caution when deciding whether or not to remove them.<\/h5>\n<\/details>\n<details id=\"e-n-accordion-item-2674\" >\n<summary data-accordion-index=\"5\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2674\" >\n\t\t\t\t\t Fix inconsistencies.<br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t<svg aria-hidden=\"true\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><br \/>\n\t\t\t\t\t\t<\/summary>\n<h5>As already mentioned, inconsistencies in data include things like typos and irregular naming conventions.<\/h5>\n<h5>You can fix these manually (for example, using the &#8220;Find and replace&#8221; function in Google Sheets or Microsoft Excel to locate one spelling or convention and replace it with another) or by creating a filter.<\/h5>\n<p>\u00a0<br \/>\n\t\t\t\t\t<\/details>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">Chances are, your dataset will contain some values that aren&#8217;t relevant to your analysis<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">For example, in an analysis of students&#8217; test scores compared to hours spent studying, things like student ID number and date of birth aren&#8217;t relevant.<\/h5>\n<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\">You could simply delete the columns containing this data.<\/h5>\n<p>\u00a0<\/p>\n<h5>\u00a0Duplicate data tends to occur during the data collection phase, so it&#8217;s important to filter them out.<\/h5>\n<h5>In the case of missing data, you can either delete the entire entry associated with it (i.e. delete the whole row which contains the empty cell), impute the missing value based on other data, or flag all missing data as such by entering &#8220;0&#8221; or &#8220;missing&#8221; in the respective cell.\u00a0<\/h5>\n<h5>Each method for handling missing data has implications for your analysis, so you&#8217;ll need to choose your approach carefully.<\/h5>\n<h5>Outliers are values that differ significantly from other values in your data.<\/h5>\n<h5>For example, if you see that most student test scores fall between 50 and 80, but that one student has scored a 2, this might be considered an outlier.\u00a0<\/h5>\n<h5>Outliers may be the result of an error, but that&#8217;s not always the case, so approach with caution when deciding whether or not to remove them.<\/h5>\n<h5>As already mentioned, inconsistencies in data include things like typos and irregular naming conventions.<\/h5>\n<h5>You can fix these manually (for example, using the &#8220;Find and replace&#8221; function in Google Sheets or Microsoft Excel to locate one spelling or convention and replace it with another) or by creating a filter.<\/h5>\n<p>\u00a0<\/p>","protected":false},"comment_status":"open","ping_status":"closed","template":"","class_list":["post-55210","lesson","type-lesson","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Lesson 1: What is data cleaning? - Certifeka-edu<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/\" \/>\n<meta property=\"og:locale\" content=\"ar_AR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Lesson 1: What is data cleaning? - Certifeka-edu\" \/>\n<meta property=\"og:description\" content=\"Lesson 1: Introduction to Data Analysis What is data cleaning? Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets. It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Certifeka-edu\" \/>\n<meta property=\"og:image\" content=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u0648\u0642\u062a \u0627\u0644\u0642\u0631\u0627\u0621\u0629 \u0627\u0644\u0645\u064f\u0642\u062f\u0651\u0631\" \/>\n\t<meta name=\"twitter:data1\" content=\"6 \u062f\u0642\u0627\u0626\u0642\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":[\"WebPage\",\"webpage\"],\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/\",\"name\":\"Lesson 1: What is data cleaning? - Certifeka-edu\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\",\"datePublished\":\"2026-04-03T11:50:07+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/#breadcrumb\"},\"inLanguage\":\"ar\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ar\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/#primaryimage\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\",\"contentUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-managers-module-ucam-university\\\/lessons\\\/lesson-1-what-is-data-cleaning-3-2\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"home\",\"item\":\"https:\\\/\\\/certifeka-edu.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Lessons\",\"item\":\"https:\\\/\\\/certifeka-edu.com\\\/lesson\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Lesson 1: What is data cleaning?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#website\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/\",\"name\":\"certifeka\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/certifeka-edu.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ar\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#organization\",\"name\":\"certifeka\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ar\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-certifeka-removebg-preview.png\",\"contentUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-certifeka-removebg-preview.png\",\"width\":366,\"height\":104,\"caption\":\"certifeka\"},\"image\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Lesson 1: What is data cleaning? - Certifeka-edu","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/","og_locale":"ar_AR","og_type":"article","og_title":"Lesson 1: What is data cleaning? - Certifeka-edu","og_description":"Lesson 1: Introduction to Data Analysis What is data cleaning? Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets. It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or [&hellip;]","og_url":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/","og_site_name":"Certifeka-edu","og_image":[{"url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_misc":{"\u0648\u0642\u062a \u0627\u0644\u0642\u0631\u0627\u0621\u0629 \u0627\u0644\u0645\u064f\u0642\u062f\u0651\u0631":"6 \u062f\u0642\u0627\u0626\u0642"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":["WebPage","webpage"],"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/","url":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/","name":"Lesson 1: What is data cleaning? - Certifeka-edu","isPartOf":{"@id":"https:\/\/certifeka-edu.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/#primaryimage"},"image":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/#primaryimage"},"thumbnailUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","datePublished":"2026-04-03T11:50:07+00:00","breadcrumb":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/#breadcrumb"},"inLanguage":"ar","potentialAction":[{"@type":"ReadAction","target":["https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/"]}]},{"@type":"ImageObject","inLanguage":"ar","@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/#primaryimage","url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","contentUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png"},{"@type":"BreadcrumbList","@id":"https:\/\/certifeka-edu.com\/programs\/research-for-managers-module-ucam-university\/lessons\/lesson-1-what-is-data-cleaning-3-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"home","item":"https:\/\/certifeka-edu.com\/"},{"@type":"ListItem","position":2,"name":"Lessons","item":"https:\/\/certifeka-edu.com\/lesson\/"},{"@type":"ListItem","position":3,"name":"Lesson 1: What is data cleaning?"}]},{"@type":"WebSite","@id":"https:\/\/certifeka-edu.com\/#website","url":"https:\/\/certifeka-edu.com\/","name":"certifeka","description":"","publisher":{"@id":"https:\/\/certifeka-edu.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/certifeka-edu.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ar"},{"@type":"Organization","@id":"https:\/\/certifeka-edu.com\/#organization","name":"certifeka","url":"https:\/\/certifeka-edu.com\/","logo":{"@type":"ImageObject","inLanguage":"ar","@id":"https:\/\/certifeka-edu.com\/#\/schema\/logo\/image\/","url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/03\/cropped-certifeka-removebg-preview.png","contentUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/03\/cropped-certifeka-removebg-preview.png","width":366,"height":104,"caption":"certifeka"},"image":{"@id":"https:\/\/certifeka-edu.com\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson\/55210","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson"}],"about":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/types\/lesson"}],"replies":[{"embeddable":true,"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/comments?post=55210"}],"version-history":[{"count":0,"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson\/55210\/revisions"}],"wp:attachment":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/media?parent=55210"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}