online.hbs.edu Open in urlscan Pro
54.192.86.46  Public Scan

Submitted URL: https://info.online.hbs.edu/e3t/Btc/48+113/c2-rz04/VVTH0D91rJbLW83yKk33vvc_MW5yX2x94Dqmv7N9k7Mb93q3n_V1-WJV7CgBhfW6vtLs11lSD...
Effective URL: https://online.hbs.edu/blog/post/data-wrangling?utm_campaign=Topic%20to%20Program%20Lead%20Nurturing&utm_medium=email&_...
Submission: On January 18 via api from CH — Scanned from DE

Form analysis 4 forms found in the DOM

<form><span class="fieldset">
    <p><input type="checkbox" value="check" id="chkMain" checked="checked" class="legacy-group-status optanon-status-checkbox"><label for="chkMain">Active</label></p>
  </span></form>

GET /blog/

<form action="/blog/" method="get">
  <div class="form-container" data-action="/blog/" data-method="get">
    <div class="form-control-group-inline"><input type="text" class="field" name="search" id="search-mobile" value="" maxlength="250" aria-label="Search Box" style="padding-right:48px;"><button type="submit" class="xi-uc btn-submit btn"
        style="background-color:#a41034;width:38px;height:38px;padding:8px 0px;" aria-label="Search Button">
        <svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" style="margin-left:-4px;transform:scale(0.65) rotate(-45deg);fill:white;" viewBox="0 0 177 100" version="1.1" focusable="false">
          <path
            d="M137.686,11.342 C137.686,-15.909 115.595,-38 88.344,-38 C61.09,-38 39,-15.909 39,11.342 C39,36.724 58.168,57.607 82.818,60.355 L82.818,72.838 C79.304,74.521 76.898,77.714 76.898,81.419 L76.898,128.148 C76.898,133.588 82.023,138 88.345,138 C94.667,138 99.789,133.589 99.789,128.148 L99.789,81.419 C99.789,77.711 97.382,74.521 93.869,72.838 L93.869,60.354 C118.515,57.606 137.686,36.726 137.686,11.342 Z M56.032,11.225 C56.032,-6.759 70.606,-21.338 88.589,-21.338 C106.572,-21.338 121.142,-6.759 121.142,11.225 C121.142,29.205 106.572,43.78 88.589,43.78 C70.606,43.78 56.032,29.205 56.032,11.225 Z"
            transform="translate(88.343000, 50.000000) rotate(90.000000) translate(-88.343000, -50.000000) "></path>
        </svg></button></div>
  </div>
</form>

GET /blog/

<form action="/blog/" method="get">
  <div class="form-container" data-action="/blog/" data-method="get">
    <div class="form-control-group-inline"><input type="text" class="field" name="search" id="search-desktop" value="" maxlength="250" aria-label="Search Box" style="padding-right:48px;"><button type="submit" class="xi-uc btn-submit btn"
        style="background-color:#a41034;width:38px;height:38px;padding:8px 0px;" aria-label="Search Button">
        <svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" style="margin-left:-4px;transform:scale(0.65) rotate(-45deg);fill:white;" viewBox="0 0 177 100" version="1.1" focusable="false">
          <path
            d="M137.686,11.342 C137.686,-15.909 115.595,-38 88.344,-38 C61.09,-38 39,-15.909 39,11.342 C39,36.724 58.168,57.607 82.818,60.355 L82.818,72.838 C79.304,74.521 76.898,77.714 76.898,81.419 L76.898,128.148 C76.898,133.588 82.023,138 88.345,138 C94.667,138 99.789,133.589 99.789,128.148 L99.789,81.419 C99.789,77.711 97.382,74.521 93.869,72.838 L93.869,60.354 C118.515,57.606 137.686,36.726 137.686,11.342 Z M56.032,11.225 C56.032,-6.759 70.606,-21.338 88.589,-21.338 C106.572,-21.338 121.142,-6.759 121.142,11.225 C121.142,29.205 106.572,43.78 88.589,43.78 C70.606,43.78 56.032,29.205 56.032,11.225 Z"
            transform="translate(88.343000, 50.000000) rotate(90.000000) translate(-88.343000, -50.000000) "></path>
        </svg></button></div>
  </div>
</form>

POST https://forms.hsforms.com/submissions/v3/public/submit/formsnext/multipart/467832/038fed31-bf48-4d40-8bfd-4c374813c25a

<form novalidate="" accept-charset="UTF-8" action="https://forms.hsforms.com/submissions/v3/public/submit/formsnext/multipart/467832/038fed31-bf48-4d40-8bfd-4c374813c25a" enctype="multipart/form-data" id="hsForm_038fed31-bf48-4d40-8bfd-4c374813c25a"
  method="POST" class="hs-form stacked hs-form-private hsForm_038fed31-bf48-4d40-8bfd-4c374813c25a hs-form-038fed31-bf48-4d40-8bfd-4c374813c25a hs-form-038fed31-bf48-4d40-8bfd-4c374813c25a_bf8166b2-83f3-4ffc-ae14-9f33a4e3c115"
  data-form-id="038fed31-bf48-4d40-8bfd-4c374813c25a" data-portal-id="467832" target="target_iframe_038fed31-bf48-4d40-8bfd-4c374813c25a" data-reactid=".hbspt-forms-0">
  <div class="hs_email hs-email hs-fieldtype-text field hs-form-field" data-reactid=".hbspt-forms-0.1:$0"><label id="label-email-038fed31-bf48-4d40-8bfd-4c374813c25a" class="" placeholder="Enter your Email"
      for="email-038fed31-bf48-4d40-8bfd-4c374813c25a" data-reactid=".hbspt-forms-0.1:$0.0"><span data-reactid=".hbspt-forms-0.1:$0.0.0">Email</span><span class="hs-form-required" data-reactid=".hbspt-forms-0.1:$0.0.1">*</span></label>
    <legend class="hs-field-desc" style="display:none;" data-reactid=".hbspt-forms-0.1:$0.1"></legend>
    <div class="input" data-reactid=".hbspt-forms-0.1:$0.$email"><input id="email-038fed31-bf48-4d40-8bfd-4c374813c25a" class="hs-input" type="email" name="email" required="" placeholder="" value="" autocomplete="email"
        data-reactid=".hbspt-forms-0.1:$0.$email.0" inputmode="email"></div>
  </div>
  <div data-reactid=".hbspt-forms-0.1:$1">
    <div class="hs-richtext hs-main-font-element" data-reactid=".hbspt-forms-0.1:$1.0">
      <p>Using assistive technology? Get <a href="https://info.online.hbs.edu/form-guidance-rmi" target="_blank" rel="noopener">more details on using this form</a>.</p>
    </div>
  </div>
  <div class="hs_external_blog_subscription_status hs-external_blog_subscription_status hs-fieldtype-booleancheckbox field hs-form-field" style="display:none;" data-reactid=".hbspt-forms-0.1:$2">
    <legend class="hs-field-desc" style="display:none;" data-reactid=".hbspt-forms-0.1:$2.1"></legend>
    <div class="input" data-reactid=".hbspt-forms-0.1:$2.$external_blog_subscription_status"><input name="external_blog_subscription_status" class="hs-input" type="hidden" value=""
        data-reactid=".hbspt-forms-0.1:$2.$external_blog_subscription_status.0"></div>
  </div>
  <div class="hs_lifecyclestage hs-lifecyclestage hs-fieldtype-radio field hs-form-field" style="display:none;" data-reactid=".hbspt-forms-0.1:$3"><label id="label-lifecyclestage-038fed31-bf48-4d40-8bfd-4c374813c25a" class=""
      placeholder="Enter your Lifecycle Stage" for="lifecyclestage-038fed31-bf48-4d40-8bfd-4c374813c25a" data-reactid=".hbspt-forms-0.1:$3.0"><span data-reactid=".hbspt-forms-0.1:$3.0.0">Lifecycle Stage</span></label>
    <legend class="hs-field-desc" style="display:none;" data-reactid=".hbspt-forms-0.1:$3.1"></legend>
    <div class="input" data-reactid=".hbspt-forms-0.1:$3.$lifecyclestage"><input name="lifecyclestage" class="hs-input" type="hidden" value="subscriber" data-reactid=".hbspt-forms-0.1:$3.$lifecyclestage.0"></div>
  </div><noscript data-reactid=".hbspt-forms-0.2"></noscript>
  <div class="hs_submit hs-submit" data-reactid=".hbspt-forms-0.5">
    <div class="hs-field-desc" style="display:none;" data-reactid=".hbspt-forms-0.5.0"></div>
    <div class="actions" data-reactid=".hbspt-forms-0.5.1"><input type="submit" value="Submit" class="hs-button primary large" data-reactid=".hbspt-forms-0.5.1.0"></div>
  </div><noscript data-reactid=".hbspt-forms-0.6"></noscript><input name="hs_context" type="hidden"
    value="{&quot;rumScriptExecuteTime&quot;:4165.799999237061,&quot;rumServiceResponseTime&quot;:4414.89999961853,&quot;rumFormRenderTime&quot;:2.5,&quot;rumTotalRenderTime&quot;:4417.60000038147,&quot;rumTotalRequestTime&quot;:247.39999961853027,&quot;embedAtTimestamp&quot;:&quot;1642490265385&quot;,&quot;formDefinitionUpdatedAt&quot;:&quot;1585253661228&quot;,&quot;pageUrl&quot;:&quot;https://online.hbs.edu/blog/post/data-wrangling?utm_campaign=Topic%20to%20Program%20Lead%20Nurturing&amp;utm_medium=email&amp;_hsmi=100462064&amp;_hsenc=p2ANqtz-9o9JrNNMoJg_35EU_Y-1dBSTjoEVIGKP5EFjeujouzMu7YdkxaKySv4QhakiMrzR_kwJ7nbXO6Rm0nMIuJO1A6t3XKbQ&amp;utm_content=100462064&amp;utm_source=hs_automation&quot;,&quot;pageTitle&quot;:&quot;Data Wrangling: What It Is &amp; Why It’s Important&quot;,&quot;source&quot;:&quot;FormsNext-static-5.432&quot;,&quot;sourceName&quot;:&quot;FormsNext&quot;,&quot;sourceVersion&quot;:&quot;5.432&quot;,&quot;sourceVersionMajor&quot;:&quot;5&quot;,&quot;sourceVersionMinor&quot;:&quot;432&quot;,&quot;timestamp&quot;:1642490265388,&quot;userAgent&quot;:&quot;Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/97.0.4692.71 Safari/537.36&quot;,&quot;originalEmbedContext&quot;:{&quot;portalId&quot;:&quot;467832&quot;,&quot;formId&quot;:&quot;038fed31-bf48-4d40-8bfd-4c374813c25a&quot;,&quot;target&quot;:&quot;#blog-signup-form-1&quot;,&quot;redirectUrl&quot;:&quot;http://hbx.hbs.edu/blog-thanks&quot;},&quot;redirectUrl&quot;:&quot;http://hbx.hbs.edu/blog-thanks&quot;,&quot;urlParams&quot;:{&quot;utm_campaign&quot;:&quot;Topic to Program Lead Nurturing&quot;,&quot;utm_medium&quot;:&quot;email&quot;,&quot;_hsmi&quot;:&quot;100462064&quot;,&quot;_hsenc&quot;:&quot;p2ANqtz-9o9JrNNMoJg_35EU_Y-1dBSTjoEVIGKP5EFjeujouzMu7YdkxaKySv4QhakiMrzR_kwJ7nbXO6Rm0nMIuJO1A6t3XKbQ&quot;,&quot;utm_content&quot;:&quot;100462064&quot;,&quot;utm_source&quot;:&quot;hs_automation&quot;},&quot;renderedFieldsIds&quot;:[&quot;email&quot;],&quot;formTarget&quot;:&quot;#blog-signup-form-1&quot;,&quot;correlationId&quot;:&quot;20c84b57-8523-440d-9efb-3e10a7db11c4&quot;,&quot;captchaStatus&quot;:&quot;NOT_APPLICABLE&quot;}"
    data-reactid=".hbspt-forms-0.7"><iframe name="target_iframe_038fed31-bf48-4d40-8bfd-4c374813c25a" style="display:none;" data-reactid=".hbspt-forms-0.8"></iframe>
</form>

Text Content

PLEASE ACCEPT HBS WEBSITE COOKIES:

This site uses cookies and similar technologies to provide online services,
enhance the performance and functionality of our services, analyze the use of
our website, and assist with our advertising and marketing efforts.


Close
Accept Cookies
Cookie Settings


 * Your Privacy

 * Strictly Necessary Cookies

 * Performance Cookies

 * Functional Cookies

 * Targeting Cookies

 * More Information

Privacy Preference Center

Active

Always Active



Save Settings

Allow All

HBS Online

 * Courses
   * Business Essentials
   * Leadership & Management
   * Analytics
   * Entrepreneurship & Innovation
   * Strategy
   * Finance & Accounting
   * Business in Society
 * For Organizations
 * Insights
 * More Info


 * About
 * Media Coverage
 * Founding Donors
 * Leadership Team
 * Careers
 * My Courses
 * My Account
 * Apply Now

HBS Home

 * About HBS
 * Academic Programs
 * Alumni
 * Faculty & Research


 * Baker Library
 * Giving
 * Harvard Business Review
 * Initiatives
 * News
 * Recruit


 * Map / Directions

Search






Skip to Main Content

 * Courses
   Open Courses Mega Menu
    * Business Essentials
      * Credential of Readiness (CORe)
      * Business Analytics
      * Economics for Managers
      * Financial Accounting
      
      
    * Leadership & Management
      * Leadership Principles
      * Management Essentials
      * Negotiation Mastery
      * Organizational Leadership
      * Strategy Execution
      
      
    * Analytics
      * Business Analytics
      * Data Science Principles
      * Data Science for Business
      * Big Data for Social Good
      * Data Privacy and Technology
      * Digital Health
   
    * Entrepreneurship & Innovation
      * Entrepreneurship Essentials
      * Disruptive Strategy
      * Negotiation Mastery
      * Design Thinking and Innovation
      
      
    * Strategy
      * Strategy Execution
      * Economics for Managers
      * Disruptive Strategy
      * Global Business
      * Sustainable Business Strategy
      * Health Care Economics
   
    * Finance & Accounting
      * Financial Accounting
      * Leading with Finance
      * Alternative Investments
      * Financial Analysis & Valuation for Lawyers
      
      
    * Business in Society
      * Sustainable Business Strategy
      * Global Business
      * Health Care Economics
      * Big Data for Social Good
      * Digital Health
      
      
    * All Courses

 * For Organizations
   Open For Organizations Mega Menu
    * Corporate Learning
      Help your employees master essential business concepts, improve
      effectiveness, and expand leadership capabilities.
   
    * Academic Solutions
      Integrate HBS Online courses into your curriculum to support programs and
      create unique educational opportunities.
   
    * Need Help?
      * Frequently Asked Questions
      * Contact Us

 * Insights
   Open Insights Mega Menu
    * Business Insights Blog
      * Career Development
      * Communication
      * Decision-Making
      * Earning Your MBA
      * Entrepreneurship & Innovation
      * Finance
      * Leadership
      * Management
      * Negotiation
      * Strategy
      
      
    * All Topics
   
    * Free Leadership Lesson
      
      Become a resilient leader in these turbulent times.
   
    * Free Guide
      
      Learn how to advance your career with essential business skills.

 * More Info
   Open More Info Mega Menu
    * Learning Experience
      Master real-world business skills with our immersive platform and engaged
      community.
      
      
    * Certificates, Credentials, & Credits
      Learn how completing courses can boost your resume and move your career
      forward.
   
    * Learning Tracks
      Take your career to the next level with this specialization.
      
      
    * Financing & Policies
      * Employer Reimbursement
      * Payment & Financial Aid
      * Policies
   
    * Connect
      * Student Stories
      * Community
      
      
    * Need Help?
      * Frequently Asked Questions
      * Request Information
      
      
    * Apply Now

Login
My Courses
Access your courses and engage with your peers

My Account
Manage your account, applications, and payments.

HBS Home

 * About HBS
 * Academic Programs
 * Alumni
 * Faculty & Research


 * Baker Library
 * Giving
 * Harvard Business Review
 * Initiatives
 * News
 * Recruit


 * Map / Directions

HBS Online
 * Courses
 * Business Essentials
 * Leadership & Management
 * Analytics
 * Entrepreneurship & Innovation
 * Strategy
 * Finance & Accounting
 * Business in Society
 * For Organizations
 * Insights
 * More Info

 * About
 * Media Coverage
 * Founding Donors
 * Leadership Team
 * Careers
 * My Courses
 * My Account
 * Apply Now


Skip to Main Content

 * Courses
   Open Courses Mega Menu
    * Business Essentials
      * Credential of Readiness (CORe)
      * Business Analytics
      * Economics for Managers
      * Financial Accounting
      
      
    * Leadership & Management
      * Leadership Principles
      * Management Essentials
      * Negotiation Mastery
      * Organizational Leadership
      * Strategy Execution
      
      
    * Analytics
      * Business Analytics
      * Data Science Principles
      * Data Science for Business
      * Big Data for Social Good
      * Data Privacy and Technology
      * Digital Health
   
    * Entrepreneurship & Innovation
      * Entrepreneurship Essentials
      * Disruptive Strategy
      * Negotiation Mastery
      * Design Thinking and Innovation
      
      
    * Strategy
      * Strategy Execution
      * Economics for Managers
      * Disruptive Strategy
      * Global Business
      * Sustainable Business Strategy
      * Health Care Economics
   
    * Finance & Accounting
      * Financial Accounting
      * Leading with Finance
      * Alternative Investments
      * Financial Analysis & Valuation for Lawyers
      
      
    * Business in Society
      * Sustainable Business Strategy
      * Global Business
      * Health Care Economics
      * Big Data for Social Good
      * Digital Health
      
      
    * All Courses

 * For Organizations
   Open For Organizations Mega Menu
    * Corporate Learning
      Help your employees master essential business concepts, improve
      effectiveness, and expand leadership capabilities.
   
    * Academic Solutions
      Integrate HBS Online courses into your curriculum to support programs and
      create unique educational opportunities.
   
    * Need Help?
      * Frequently Asked Questions
      * Contact Us

 * Insights
   Open Insights Mega Menu
    * Business Insights Blog
      * Career Development
      * Communication
      * Decision-Making
      * Earning Your MBA
      * Entrepreneurship & Innovation
      * Finance
      * Leadership
      * Management
      * Negotiation
      * Strategy
      
      
    * All Topics
   
    * Free Leadership Lesson
      
      Become a resilient leader in these turbulent times.
   
    * Free Guide
      
      Learn how to advance your career with essential business skills.

 * More Info
   Open More Info Mega Menu
    * Learning Experience
      Master real-world business skills with our immersive platform and engaged
      community.
      
      
    * Certificates, Credentials, & Credits
      Learn how completing courses can boost your resume and move your career
      forward.
   
    * Learning Tracks
      Take your career to the next level with this specialization.
      
      
    * Financing & Policies
      * Employer Reimbursement
      * Payment & Financial Aid
      * Policies
   
    * Connect
      * Student Stories
      * Community
      
      
    * Need Help?
      * Frequently Asked Questions
      * Request Information
      
      
    * Apply Now

Login
My Courses
Access your courses and engage with your peers

My Account
Manage your account, applications, and payments.

HBS Home

 * About HBS
 * Academic Programs
 * Alumni
 * Faculty & Research


 * Baker Library
 * Giving
 * Harvard Business Review
 * Initiatives
 * News
 * Recruit


 * Map / Directions

HBS Online
 * Courses
 * Business Essentials
 * Leadership & Management
 * Analytics
 * Entrepreneurship & Innovation
 * Strategy
 * Finance & Accounting
 * Business in Society
 * For Organizations
 * Insights
 * More Info

 * About
 * Media Coverage
 * Founding Donors
 * Leadership Team
 * Careers
 * My Courses
 * My Account
 * Apply Now


 * …→
 * Harvard Business School→
 * HBS Online→


BUSINESS INSIGHTS

Harvard Business School Online's Business Insights Blog provides the career
insights you need to achieve your goals and gain confidence in your business
skills.


 
Filter Results Arrow Down Arrow Up

TOPICS

TOPICS

 * Accounting
 * Analytics
 * Business Essentials
 * Business in Society
 * Career Development
 * Communication
 * Community
 * ConneXt
 * Decision-Making
 * Earning Your MBA
 * Entrepreneurship & Innovation
 * Finance
 * Leadership
 * Management
 * Marketing
 * Negotiation
 * News & Events
 * Productivity
 * Staff Spotlight
 * Strategy
 * Student Profiles
 * Technology
 * Work-Life Balance



COURSES

COURSES

 * Alternative Investments
 * Big Data for Social Good
 * Business Analytics
 * CORe
 * Data Privacy and Technology
 * Data Science Principles
 * Data Science for Business
 * Design Thinking and Innovation
 * Digital Health
 * Disruptive Strategy
 * Economics for Managers
 * Entrepreneurship Essentials
 * Financial Accounting
 * Financial Analysis and Valuation for Lawyers
 * Global Business
 * Health Care Economics
 * Leadership Principles
 * Leading with Finance
 * Management Essentials
 * Negotiation Mastery
 * Organizational Leadership
 * Strategy Execution
 * Sustainable Business Strategy



Subscribe to the Blog


RSS feed

TOPICS

TOPICS

 * Accounting
 * Analytics
 * Business Essentials
 * Business in Society
 * Career Development
 * Communication
 * Community
 * ConneXt
 * Decision-Making
 * Earning Your MBA
 * Entrepreneurship & Innovation
 * Finance
 * Leadership
 * Management
 * Marketing
 * Negotiation
 * News & Events
 * Productivity
 * Staff Spotlight
 * Strategy
 * Student Profiles
 * Technology
 * Work-Life Balance



COURSES

COURSES

 * Alternative Investments
 * Big Data for Social Good
 * Business Analytics
 * CORe
 * Data Privacy and Technology
 * Data Science Principles
 * Data Science for Business
 * Design Thinking and Innovation
 * Digital Health
 * Disruptive Strategy
 * Economics for Managers
 * Entrepreneurship Essentials
 * Financial Accounting
 * Financial Analysis and Valuation for Lawyers
 * Global Business
 * Health Care Economics
 * Leadership Principles
 * Leading with Finance
 * Management Essentials
 * Negotiation Mastery
 * Organizational Leadership
 * Strategy Execution
 * Sustainable Business Strategy



Subscribe to the Blog

Email*


Using assistive technology? Get more details on using this form.


Lifecycle Stage


RSS feed


DATA WRANGLING: WHAT IT IS & WHY IT’S IMPORTANT


 * 19 Jan 2021

Tim Stobierski Author Contributors
tag
 * Analytics
 * Data Science for Business


 * Email
 * Print
 * Share
    * Facebook
    * LinkedIn
    * Twitter
    * Email



Businesses have long relied on professionals with data science and analytical
skills to understand and leverage information at their disposal. With the
proliferation of data, due to the development of smart devices and other
technological advancements, this need has accelerated.

It’s impossible to choose a single data science skill that’s most important for
business professionals. One thing that's certain, however, is that insights are
only as good as the data that informs them. This means it’s vital for
organizations to employ individuals who understand what clean data looks like
and how to shape raw data into usable forms. This is where data wrangling comes
into play.

Below is an overview of what data wrangling is, its key steps, and why it’s
crucial for business.

--------------------------------------------------------------------------------

Free E-Book: A Beginner's Guide to Data & Analytics

Access your free e-book today.

DOWNLOAD NOW

--------------------------------------------------------------------------------



WHAT IS DATA WRANGLING?

Data wrangling—also called data cleaning, data remediation, or data
munging—refers to a variety of processes designed to transform raw data into
more readily used formats. The exact methods differ from project to project
depending on the data you’re leveraging and the goal you’re trying to achieve.

Some examples of data wrangling include:

 * Merging multiple data sources into a single dataset for analysis
 * Identifying gaps in data (for example, empty cells in a spreadsheet) and
   either filling or deleting them
 * Deleting data that’s either unnecessary or irrelevant to the project you’re
   working on
 * Identifying extreme outliers in data and either explaining the discrepancies
   or removing them so that analysis can take place

Data wrangling can be a manual or automated process. In scenarios where datasets
are exceptionally large, automated data cleaning becomes a necessity. In
organizations that employ a full data team, a data scientist or other team
member is typically responsible for data wrangling. In smaller organizations,
non-data professionals are often responsible for cleaning their data before
leveraging it.


DATA WRANGLING STEPS

Each data project requires a unique approach to ensure its final dataset is
reliable and accessible. That being said, several processes typically inform the
approach. These are commonly referred to as data wrangling steps or activities.


1. DISCOVERY



Discovery refers to the process of familiarizing yourself with data so you can
conceptualize how you might use it. You can liken it to looking in your
refrigerator before cooking a meal to see what ingredients you have at your
disposal.

During discovery, you may identify trends or patterns in the data, along with
obvious issues, such as missing or incomplete values that need to be addressed.
This is an important step, as it will inform every activity that comes
afterward.


2. STRUCTURING



Raw data is typically unusable in its raw state because it’s either incomplete
or misformatted for its intended application. Data structuring is the process of
taking raw data and transforming it to be more readily leveraged. The form your
data takes will depend on the analytical model you use to interpret it.


3. CLEANING



Data cleaning is the process of removing inherent errors in data that might
distort your analysis or render it less valuable. Cleaning can come in different
forms, including deleting empty cells or rows, removing outliers, and
standardizing inputs. The goal of data cleaning is to ensure there are no errors
(or as few as possible) that could influence your final analysis.


4. ENRICHING



Once you understand your existing data and have transformed it into a more
usable state, you must determine whether you have all of the data necessary for
the project at hand. If not, you may choose to enrich or augment your data by
incorporating values from other datasets. For this reason, it’s important to
understand what other data is available for use.

If you decide that enrichment is necessary, you need to repeat the steps above
for any new data.


5. VALIDATING



Data validation refers to the process of verifying that your data is both
consistent and of a high enough quality. During validation, you may discover
issues you need to resolve or conclude that your data is ready to be analyzed.
Validation is typically achieved through various automated processes and
requires programming.


6. PUBLISHING



Once your data has been validated, you can publish it. This involves making it
available to others within your organization for analysis. The format you use to
share the information—such as a written report or electronic file—will depend on
your data and the organization’s goals.





THE IMPORTANCE OF DATA WRANGLING

Any analyses a business performs will ultimately be constrained by the data that
informs them. If data is incomplete, unreliable, or faulty, then analyses will
be too—diminishing the value of any insights gleaned.

Data wrangling seeks to remove that risk by ensuring data is in a reliable state
before it’s analyzed and leveraged. This makes it a critical part of the
analytical process.

It’s important to note that data wrangling can be time-consuming and taxing on
resources, particularly when done manually. This is why many organizations
institute policies and best practices that help employees streamline the data
cleanup process—for example, requiring that data include certain information or
be in a specific format before it’s uploaded to a database.

For this reason, it’s vital to understand the steps of the data wrangling
process and the negative outcomes associated with incorrect or faulty data.

Are you interested in improving your data science and analytical skills? Learn
more about Data Science for Business and our other online analytics courses, and
discover how you can use data to generate insights and tackle business
decisions.




ABOUT THE AUTHOR

Tim Stobierski is a marketing specialist and contributing writer for Harvard
Business School Online.




 
All FAQs


TOP FAQS




HOW ARE HBS ONLINE COURSES DELIVERED?

+–

We offer self-paced programs (with weekly deadlines) on the HBS Online course
platform.

Our platform features short, highly produced videos of HBS faculty and guest
business experts, interactive graphs and exercises, cold calls to keep you
engaged, and opportunities to contribute to a vibrant online community.



DO I NEED TO COME TO CAMPUS TO PARTICIPATE IN HBS ONLINE PROGRAMS?

+–

No, all of our programs are 100 percent online, and available to participants
regardless of their location.



HOW DO I ENROLL IN A COURSE?

+–

All programs require the completion of a brief application. The applications
vary slightly from program to program, but all ask for some personal background
information. You can apply for and enroll in programs here. If you are new to
HBS Online, you will be required to set up an account before starting an
application for the program of your choice.

Our easy online application is free, and no special documentation is required.
All applicants must be at least 18 years of age, proficient in English, and
committed to learning and engaging with fellow participants throughout the
program.

After submitting your application, you should receive an email confirmation from
HBS Online. If you do not receive this email, please check your junk email
folders and double-check your account to make sure the application was
successfully submitted.

Updates to your application and enrollment status will be shown on your
Dashboard. We confirm enrollment eligibility within one week of your
application.



DOES HARVARD BUSINESS SCHOOL ONLINE OFFER AN ONLINE MBA?

+–

No, Harvard Business School Online offers business certificate programs.



WHAT ARE MY PAYMENT OPTIONS?

+–

We accept payments via credit card, Western Union, and (when available) bank
loan. Some candidates may qualify for scholarships or financial aid, which will
be credited against the Program Fee once eligibility is determined. Please refer
to the Payment & Financial Aid page for further information.

We also allow you to split your payment across 2 separate credit card
transactions or send a payment link email to another person on your behalf. If
splitting your payment into 2 transactions, a minimum payment of $350 is
required for the first transaction.

In all cases, net Program Fees must be paid in full (in US Dollars) to complete
registration.



WHAT ARE THE POLICIES FOR REFUNDS AND DEFERRALS?

+–

After enrolling in a program, you may request a withdrawal with refund (minus a
$100 nonrefundable enrollment fee) up until 24 hours after the start of your
program. Please review the Program Policies page for more details on refunds and
deferrals. If your employer has contracted with HBS Online for participation in
a program, or if you elect to enroll in the undergraduate credit option of the
Credential of Readiness (CORe) program, note that policies for these options may
differ.


 





SIGN UP FOR NEWS & ANNOUNCEMENTS




 * 
 * 
 * 
 * 
 * 


SUBJECT AREAS


 * Business Essentials
 * Leadership & Management
 * Analytics
 * Entrepreneurship & Innovation
 * Strategy
 * Finance & Accounting
 * Business & Society


QUICK LINKS


 * FAQs
 * Contact Us
 * Request Info
 * Apply Now


ABOUT


 * About Us
 * Media Coverage
 * Founding Donors
 * Leadership Team
 * Careers @ HBS Online


LEGAL


 * Legal
 * Policies


Copyright © President & Fellows of Harvard College
 * Site Map
 * Trademark Notice
 * Digital Accessibility


PDF




×