{"id":23670,"date":"2025-08-11T18:19:42","date_gmt":"2025-08-11T18:19:42","guid":{"rendered":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/"},"modified":"2025-08-25T18:36:27","modified_gmt":"2025-08-25T18:36:27","slug":"lesson-1-what-is-data-cleaning","status":"publish","type":"lesson","link":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/","title":{"rendered":"Lesson 1: What is data cleaning?"},"content":{"rendered":"<div data-elementor-type=\"wp-post\" data-elementor-id=\"23670\" class=\"elementor elementor-23670\" wpc-filter-elementor-widget=\"1\">\n\t\t\t\t<div class=\"elementor-element elementor-element-2c4d08a e-con-full e-flex e-con e-parent\" data-id=\"2c4d08a\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div class=\"elementor-element elementor-element-62617d3 e-con-full e-flex e-con e-child\" data-id=\"62617d3\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-14a622b exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-image\" data-id=\"14a622b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"96\" height=\"114\" src=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png\" class=\"attachment-large size-large wp-image-16861\" alt=\"\" srcset=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png 96w, https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1-10x12.png 10w, https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1-42x50.png 42w\" sizes=\"(max-width: 96px) 100vw, 96px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-8915bff e-con-full e-flex e-con e-child\" data-id=\"8915bff\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-0224633 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"0224633\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Lesson 1: Introduction to Data Analysis<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-299bb5d e-flex e-con-boxed e-con e-parent\" data-id=\"299bb5d\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-3712d9b exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"3712d9b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">What is data cleaning?\n\n<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c24bdfb exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"c24bdfb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets.<\/span><\/h5><h5><span style=\"color: #000080;\">It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or unreliable results if left unchecked.<\/span><\/h5><h5><span style=\"color: #000080;\">The process of data cleaning typically involves several steps, including removing duplicates, correcting misspellings and typos, handling missing or null values, standardizing data formats, and identifying and dealing with outliers or anomalies. Data cleaning may also involve verifying that data conforms to a set of predefined rules or constraints, such as data integrity constraints or business rules.<\/span><\/h5><h5><span style=\"color: #000080;\">The goal of data cleaning is to ensure that data is accurate, consistent, and reliable, so that it can be used effectively in analysis and decision-making. Data cleaning is often a time-consuming and labor-intensive process, but it is an important step in ensuring the quality and reliability of data.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-45eaa33 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"45eaa33\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">What is dirty data?<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fddfd96 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"fddfd96\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">Dirty data is essentially any data that needs to be manipulated or worked on in some way before it can be analysed.<\/span><\/h5><h5><span style=\"color: #000080;\">Some types of dirty data include:<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-e5c48b9 e-flex e-con-boxed e-con e-parent\" data-id=\"e5c48b9\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-33f321f exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-n-accordion\" data-id=\"33f321f\" data-element_type=\"widget\" data-e-type=\"widget\" data-settings=\"{&quot;default_state&quot;:&quot;expanded&quot;,&quot;max_items_expended&quot;:&quot;one&quot;,&quot;n_accordion_animation_duration&quot;:{&quot;unit&quot;:&quot;ms&quot;,&quot;size&quot;:400,&quot;sizes&quot;:[]}}\" data-widget_type=\"nested-accordion.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"e-n-accordion\" aria-label=\"Accordion. Open links with Enter or Space, close with Escape, and navigate with Arrow Keys\">\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-5440\" class=\"e-n-accordion-item\" open>\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"1\" tabindex=\"0\" aria-expanded=\"true\" aria-controls=\"e-n-accordion-item-5440\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\">  Incomplete data  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5440\" class=\"elementor-element elementor-element-0d6e87b e-con-full e-flex e-con e-child\" data-id=\"0d6e87b\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5440\" class=\"elementor-element elementor-element-ff6ab18 e-con-full e-flex e-con e-child\" data-id=\"ff6ab18\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-5e1ad55 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"5e1ad55\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">For example, a spreadsheet with missing values that would be relevant for your analysis.<\/span><\/h5><h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">If you&#8217;re looking at the relationship between customer age and a number of monthly purchases, you&#8217;ll need data for both of these variables.<\/span><\/h5><h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">If some customer ages are missing, you&#8217;re dealing with incomplete data.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-5441\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"2\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5441\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Duplicate data  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5441\" class=\"elementor-element elementor-element-2d65e11 e-con-full e-flex e-con e-child\" data-id=\"2d65e11\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5441\" class=\"elementor-element elementor-element-9fc6c3d e-con-full e-flex e-con e-child\" data-id=\"9fc6c3d\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-16da954 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"16da954\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">For example,records that appear twice (or multiple times) throughout the same dataset. This can occur if you&#8217;re combining data from multiple sources or databases.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-5442\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"3\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5442\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Inconsistent or inaccurate data  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5442\" class=\"elementor-element elementor-element-fe5f66e e-con-full e-flex e-con e-child\" data-id=\"fe5f66e\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5442\" class=\"elementor-element elementor-element-0a7207c e-con-full e-flex e-con e-child\" data-id=\"0a7207c\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-8deae0f exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"8deae0f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">data that is outdated or contains structural errors such as typos, inconsistent capitalization, and irregular naming conventions.<\/span><\/h5><h5><span style=\"color: #000080;\">Say you have a dataset containing student test scores, with some categorized as &#8220;Pass&#8221; or &#8220;Fail&#8221; and others categorized as &#8220;P&#8221; or &#8220;F.&#8221;<\/span><\/h5><h5><span style=\"color: #000080;\">Both labels mean the same thing, but the naming convention is inconsistent, leaving the data rather messy.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-5443\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"4\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-5443\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Misaligned data <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5443\" class=\"elementor-element elementor-element-241cb54 e-con-full e-flex e-con e-child\" data-id=\"241cb54\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-5443\" class=\"elementor-element elementor-element-a7966c1 e-con-full e-flex e-con e-child\" data-id=\"a7966c1\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-97d1442 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"97d1442\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">Misaligned data refers to the situation where data is placed in the wrong fields or columns in a dataset.<\/span><\/h5><h5><span style=\"color: #000080;\">For example, imagine a dataset that includes information about employees in a company, where the salary data is placed in the field intended for employee names.<\/span><\/h5><h5><span style=\"color: #000080;\">This type of error can be caused by various reasons, such as manual data entry errors, technical issues in data import or export, or formatting problems.<\/span><\/h5><div>\u00a0<\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e92fc5d exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"e92fc5d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">What are some key steps in the data-cleaning process?\n\n<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2d6b3c8 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"2d6b3c8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">We&#8217;ve established how important the data-cleaning stage is.<\/span><\/h5><h5><span style=\"color: #000080;\">Now let&#8217;s introduce some data-cleaning techniques! To clean your data, you might do some or all of the following:<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-19779bf exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-n-accordion\" data-id=\"19779bf\" data-element_type=\"widget\" data-e-type=\"widget\" data-settings=\"{&quot;default_state&quot;:&quot;expanded&quot;,&quot;max_items_expended&quot;:&quot;one&quot;,&quot;n_accordion_animation_duration&quot;:{&quot;unit&quot;:&quot;ms&quot;,&quot;size&quot;:400,&quot;sizes&quot;:[]}}\" data-widget_type=\"nested-accordion.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"e-n-accordion\" aria-label=\"Accordion. Open links with Enter or Space, close with Escape, and navigate with Arrow Keys\">\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-2670\" class=\"e-n-accordion-item\" open>\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"1\" tabindex=\"0\" aria-expanded=\"true\" aria-controls=\"e-n-accordion-item-2670\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\">  Delete Unnecessary Columns  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2670\" class=\"elementor-element elementor-element-dae50e1 e-con-full e-flex e-con e-child\" data-id=\"dae50e1\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2670\" class=\"elementor-element elementor-element-661b394 e-con-full e-flex e-con e-child\" data-id=\"661b394\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-819a8a7 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"819a8a7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">Chances are, your dataset will contain some values that aren&#8217;t relevant to your analysis<\/span><\/h5><h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">For example, in an analysis of students&#8217; test scores compared to hours spent studying, things like student ID number and date of birth aren&#8217;t relevant.<\/span><\/h5><h5 tabindex=\"0\" data-element-id=\"ebookHeading4\" data-node-type=\"text\" data-magic=\"col-description\"><span style=\"color: #011c7c;\">You could simply delete the columns containing this data.<\/span><\/h5><div>\u00a0<\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-2671\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"2\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2671\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Identify and remove duplicates.  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2671\" class=\"elementor-element elementor-element-0eb10bd e-con-full e-flex e-con e-child\" data-id=\"0eb10bd\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2671\" class=\"elementor-element elementor-element-d047c0b e-con-full e-flex e-con e-child\" data-id=\"d047c0b\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-7a28594 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"7a28594\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">\u00a0Duplicate data tends to occur during the data collection phase, so it&#8217;s important to filter them out.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-2672\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"3\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2672\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Deal with missing data.  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2672\" class=\"elementor-element elementor-element-65fd4b9 e-con-full e-flex e-con e-child\" data-id=\"65fd4b9\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2672\" class=\"elementor-element elementor-element-8c00c72 e-con-full e-flex e-con e-child\" data-id=\"8c00c72\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-a69e249 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"a69e249\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">In the case of missing data, you can either delete the entire entry associated with it (i.e. delete the whole row which contains the empty cell), impute the missing value based on other data, or flag all missing data as such by entering &#8220;0&#8221; or &#8220;missing&#8221; in the respective cell.\u00a0<\/span><\/h5><h5><span style=\"color: #000080;\">Each method for handling missing data has implications for your analysis, so you&#8217;ll need to choose your approach carefully.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-2673\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"4\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2673\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Remove unwanted outliers.  <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2673\" class=\"elementor-element elementor-element-860c435 e-con-full e-flex e-con e-child\" data-id=\"860c435\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2673\" class=\"elementor-element elementor-element-977ffd6 e-con-full e-flex e-con e-child\" data-id=\"977ffd6\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-6ea39cf exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"6ea39cf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">Outliers are values that differ significantly from other values in your data.<\/span><\/h5><h5><span style=\"color: #000080;\">For example, if you see that most student test scores fall between 50 and 80, but that one student has scored a 2, this might be considered an outlier.\u00a0<\/span><\/h5><h5><span style=\"color: #000080;\">Outliers may be the result of an error, but that&#8217;s not always the case, so approach with caution when deciding whether or not to remove them.<\/span><\/h5>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-2674\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"5\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-2674\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Fix inconsistencies. <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewbox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2674\" class=\"elementor-element elementor-element-29ada08 e-con-full e-flex e-con e-child\" data-id=\"29ada08\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-2674\" class=\"elementor-element elementor-element-ac19184 e-con-full e-flex e-con e-child\" data-id=\"ac19184\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-9544d62 exad-sticky-section-no exad-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"9544d62\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<h5><span style=\"color: #000080;\">As already mentioned, inconsistencies in data include things like typos and irregular naming conventions.<\/span><\/h5><h5><span style=\"color: #000080;\">You can fix these manually (for example, using the &#8220;Find and replace&#8221; function in Google Sheets or Microsoft Excel to locate one spelling or convention and replace it with another) or by creating a filter.<\/span><\/h5><div>\u00a0<\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>","protected":false},"comment_status":"open","ping_status":"closed","template":"","class_list":["post-23670","lesson","type-lesson","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Lesson 1: What is data cleaning? - Certifeka-edu<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/\" \/>\n<meta property=\"og:locale\" content=\"ar_AR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Lesson 1: What is data cleaning? - Certifeka-edu\" \/>\n<meta property=\"og:description\" content=\"Lesson 1: Introduction to Data Analysis What is data cleaning? Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets. It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/\" \/>\n<meta property=\"og:site_name\" content=\"Certifeka-edu\" \/>\n<meta property=\"article:modified_time\" content=\"2025-08-25T18:36:27+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u0648\u0642\u062a \u0627\u0644\u0642\u0631\u0627\u0621\u0629 \u0627\u0644\u0645\u064f\u0642\u062f\u0651\u0631\" \/>\n\t<meta name=\"twitter:data1\" content=\"6 \u062f\u0642\u0627\u0626\u0642\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":[\"WebPage\",\"webpage\"],\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/\",\"name\":\"Lesson 1: What is data cleaning? - Certifeka-edu\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\",\"datePublished\":\"2025-08-11T18:19:42+00:00\",\"dateModified\":\"2025-08-25T18:36:27+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/#breadcrumb\"},\"inLanguage\":\"ar\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ar\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/#primaryimage\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\",\"contentUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/04\\\/logos-png-01-296x57-1.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/programs\\\/research-for-strategic-development-professional-certificate\\\/lessons\\\/lesson-1-what-is-data-cleaning\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"home\",\"item\":\"https:\\\/\\\/certifeka-edu.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Lessons\",\"item\":\"https:\\\/\\\/certifeka-edu.com\\\/lesson\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Lesson 1: What is data cleaning?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#website\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/\",\"name\":\"certifeka\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/certifeka-edu.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ar\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#organization\",\"name\":\"certifeka\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ar\",\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-certifeka-removebg-preview.png\",\"contentUrl\":\"https:\\\/\\\/certifeka-edu.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-certifeka-removebg-preview.png\",\"width\":366,\"height\":104,\"caption\":\"certifeka\"},\"image\":{\"@id\":\"https:\\\/\\\/certifeka-edu.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Lesson 1: What is data cleaning? - Certifeka-edu","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/","og_locale":"ar_AR","og_type":"article","og_title":"Lesson 1: What is data cleaning? - Certifeka-edu","og_description":"Lesson 1: Introduction to Data Analysis What is data cleaning? Data cleaning, also known as data cleansing, is the process of identifying and correcting errors, inconsistencies, and inaccuracies in datasets. It is an essential step in data analysis and data processing, as raw data often contains errors and inconsistencies that can lead to incorrect or [&hellip;]","og_url":"https:\/\/certifeka-edu.com\/ar\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/","og_site_name":"Certifeka-edu","article_modified_time":"2025-08-25T18:36:27+00:00","og_image":[{"url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_misc":{"\u0648\u0642\u062a \u0627\u0644\u0642\u0631\u0627\u0621\u0629 \u0627\u0644\u0645\u064f\u0642\u062f\u0651\u0631":"6 \u062f\u0642\u0627\u0626\u0642"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":["WebPage","webpage"],"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/","url":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/","name":"Lesson 1: What is data cleaning? - Certifeka-edu","isPartOf":{"@id":"https:\/\/certifeka-edu.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/#primaryimage"},"image":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/#primaryimage"},"thumbnailUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","datePublished":"2025-08-11T18:19:42+00:00","dateModified":"2025-08-25T18:36:27+00:00","breadcrumb":{"@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/#breadcrumb"},"inLanguage":"ar","potentialAction":[{"@type":"ReadAction","target":["https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/"]}]},{"@type":"ImageObject","inLanguage":"ar","@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/#primaryimage","url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png","contentUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/04\/logos-png-01-296x57-1.png"},{"@type":"BreadcrumbList","@id":"https:\/\/certifeka-edu.com\/programs\/research-for-strategic-development-professional-certificate\/lessons\/lesson-1-what-is-data-cleaning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"home","item":"https:\/\/certifeka-edu.com\/"},{"@type":"ListItem","position":2,"name":"Lessons","item":"https:\/\/certifeka-edu.com\/lesson\/"},{"@type":"ListItem","position":3,"name":"Lesson 1: What is data cleaning?"}]},{"@type":"WebSite","@id":"https:\/\/certifeka-edu.com\/#website","url":"https:\/\/certifeka-edu.com\/","name":"certifeka","description":"","publisher":{"@id":"https:\/\/certifeka-edu.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/certifeka-edu.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ar"},{"@type":"Organization","@id":"https:\/\/certifeka-edu.com\/#organization","name":"certifeka","url":"https:\/\/certifeka-edu.com\/","logo":{"@type":"ImageObject","inLanguage":"ar","@id":"https:\/\/certifeka-edu.com\/#\/schema\/logo\/image\/","url":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/03\/cropped-certifeka-removebg-preview.png","contentUrl":"https:\/\/certifeka-edu.com\/wp-content\/uploads\/2025\/03\/cropped-certifeka-removebg-preview.png","width":366,"height":104,"caption":"certifeka"},"image":{"@id":"https:\/\/certifeka-edu.com\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson\/23670","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson"}],"about":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/types\/lesson"}],"replies":[{"embeddable":true,"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/comments?post=23670"}],"version-history":[{"count":0,"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/lesson\/23670\/revisions"}],"wp:attachment":[{"href":"https:\/\/certifeka-edu.com\/ar\/wp-json\/wp\/v2\/media?parent=23670"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}