www.kdnuggets.com Open in urlscan Pro
2606:4700:20::ac43:44b2  Public Scan

Submitted URL: https://trk.klclick2.com/ls/click?upn=fIFn-2FSottIh5HSxkTGOh8S75Ut7ZBO9-2FhCILYeFoKWxQD713TPP0moju5aD2AcC3f1bBdCe8pjvYT4F...
Effective URL: https://www.kdnuggets.com/2022/04/python-libraries-data-scientists-know-2022.html?utm_source=Klaviyo&utm_medium=email&utm_...
Submission: On May 13 via api from US — Scanned from DE

Form analysis 7 forms found in the DOM

GET /

<form method="get" id="searchform" action="/">
  <input value="" name="s" id="s" placeholder="Search KDnuggets…" type="text">
  <button>
    <img src="https://www.kdnuggets.com/wp-content/themes/kdn17/images/search.svg" alt="Search">
  </button>
</form>

POST

<form id="mc4wp-form-3" class="mc4wp-form mc4wp-form-138095 mc4wp-ajax" method="post" data-id="138095" data-name="Subscribe to KDnuggets News (Article)">
  <div class="mc4wp-form-fields">
    <div class="header-container">
      <div><img src="/wp-content/uploads/newsletter.png" width="100"></div>
      <p><a href="/news/subscribe.html">Get the FREE collection of 50+ data science cheatsheets and the leading newsletter on AI, Data Science, and Machine Learning, straight to your inbox.</a></p>
    </div>
    <div class="form-fields">
      <div class="field-container"><input type="email" name="EMAIL" size="30" maxlength="60" placeholder="Your Email" required=""></div>
      <div class="field-container submit-container" "form-button"="">
        <input class="btn" type="submit" value="Sign Up">
      </div>
      <p style="font-size:11px; margin-top: 10px;">By subscribing you accept KDnuggets <a style="font-size:10px" target="_blank" href="https://kdnuggets.local/news/privacy-policy.html">Privacy Policy</a></p>
      <input type="hidden" name="Form Signup Location" value="article">
    </div>
  </div><label style="display: none !important;">Leave this field empty if you're human: <input type="text" name="_mc4wp_honeypot" value="" tabindex="-1" autocomplete="off"></label><input type="hidden" name="_mc4wp_timestamp"
    value="1652465178"><input type="hidden" name="_mc4wp_form_id" value="138095"><input type="hidden" name="_mc4wp_form_element_id" value="mc4wp-form-3">
  <div class="mc4wp-response"></div>
</form>

GET /

<form method="get" id="searchform" action="/">
  <input value="" name="s" id="s" placeholder="Search KDnuggets…" type="text">
  <button>
    <img src="https://www.kdnuggets.com/wp-content/themes/kdn17/images/search.svg" alt="Search">
  </button>
</form>

POST

<form id="mc4wp-form-4" class="mc4wp-form mc4wp-form-77281 mc4wp-ajax" method="post" data-id="77281" data-name="Subscribe to KDnuggets News (Sidebar)">
  <div class="mc4wp-form-fields">
    <div class="header-container">
      <div><img src="/wp-content/uploads/newsletter.png" width="100"></div>
      <p><a href="/news/subscribe.html">Get the FREE collection of 50+ data science cheatsheets and the leading newsletter on AI, Data Science, and Machine Learning, straight to your inbox.</a></p>
    </div>
    <div class="form-fields">
      <div class="field-container"><input type="email" name="EMAIL" size="30" maxlength="60" placeholder="Your Email" required=""></div>
      <div class="field-container submit-container" "form-button"="">
        <input class="btn" type="submit" value="Sign Up">
      </div>
      <p style="font-size:11px; margin-top: 10px;">By subscribing you accept KDnuggets <a style="font-size:10px" target="_blank" href="https://kdnuggets.local/news/privacy-policy.html">Privacy Policy</a></p>
    </div>
  </div><label style="display: none !important;">Leave this field empty if you're human: <input type="text" name="_mc4wp_honeypot" value="" tabindex="-1" autocomplete="off"></label><input type="hidden" name="_mc4wp_timestamp"
    value="1652465178"><input type="hidden" name="_mc4wp_form_id" value="77281"><input type="hidden" name="_mc4wp_form_element_id" value="mc4wp-form-4">
  <div class="mc4wp-response"></div>
</form>

POST

<form id="mc4wp-form-5" class="mc4wp-form mc4wp-form-138093 mc4wp-ajax" method="post" data-id="138093" data-name="Subscribe to KDnuggets News (Footer)">
  <div class="mc4wp-form-fields">
    <p>
      <strong>Subscribe To Our Newsletter</strong> (Get 50+ FREE Cheatsheets)
    </p>
    <div class="footer-news-form">
      <input type="email" name="EMAIL" placeholder="Your email address" required="">
      <input class="btn" type="submit" value="Subscribe">
      <input type="hidden" name="Form Signup Location" value="footer">
    </div>
  </div><label style="display: none !important;">Leave this field empty if you're human: <input type="text" name="_mc4wp_honeypot" value="" tabindex="-1" autocomplete="off"></label><input type="hidden" name="_mc4wp_timestamp"
    value="1652465178"><input type="hidden" name="_mc4wp_form_id" value="138093"><input type="hidden" name="_mc4wp_form_element_id" value="mc4wp-form-5">
  <div class="mc4wp-response"></div>
</form>

POST

<form id="mc4wp-form-6" class="mc4wp-form mc4wp-form-138213 mc4wp-ajax" method="post" data-id="138213" data-name="Subscribe to KDnuggets News (Popup Button)">
  <div class="mc4wp-form-fields">
    <div class="header-container">
      <div><img src="/wp-content/uploads/newsletter-alt.png" width="100"></div>
      <p><a href="/news/subscribe.html">Get the FREE collection of 50+ data science cheatsheets and the leading newsletter on AI, Data Science, and Machine Learning, straight to your inbox.</a></p>
    </div>
    <div class="form-fields">
      <div class="field-container"><input type="email" name="EMAIL" size="30" maxlength="60" placeholder="Your Email" required=""></div>
      <div class="field-container submit-container" "form-button"="">
        <input class="btn" type="submit" value="Sign Up">
      </div>
      <p style="font-size:11px; margin-top: 10px;">By subscribing you accept KDnuggets <a style="font-size:10px" target="_blank" href="https://kdnuggets.local/news/privacy-policy.html">Privacy Policy</a></p>
      <input type="hidden" name="Form Signup Location" value="popup_button">
    </div>
  </div><label style="display: none !important;">Leave this field empty if you're human: <input type="text" name="_mc4wp_honeypot" value="" tabindex="-1" autocomplete="off"></label><input type="hidden" name="_mc4wp_timestamp"
    value="1652465178"><input type="hidden" name="_mc4wp_form_id" value="138213"><input type="hidden" name="_mc4wp_form_element_id" value="mc4wp-form-6">
  <div class="mc4wp-response"></div>
</form>

POST

<form id="mc4wp-form-7" class="mc4wp-form mc4wp-form-138094 mc4wp-ajax" method="post" data-id="138094" data-name="Subscribe to KDnuggets News (Popup)">
  <div class="mc4wp-form-fields">
    <div class="header-container">
      <div><img src="/wp-content/uploads/newsletter-alt.png" width="100"></div>
      <p><a href="/news/subscribe.html">Get the FREE collection of 50+ data science cheatsheets and the leading newsletter on AI, Data Science, and Machine Learning, straight to your inbox.</a></p>
    </div>
    <div class="form-fields">
      <div class="field-container"><input type="email" name="EMAIL" size="30" maxlength="60" placeholder="Your Email" required=""></div>
      <div class="field-container submit-container" "form-button"="">
        <input class="btn" type="submit" value="Sign Up">
      </div>
      <p style="font-size:11px; margin-top: 10px;">By subscribing you accept KDnuggets <a style="font-size:10px" target="_blank" href="https://kdnuggets.local/news/privacy-policy.html">Privacy Policy</a></p>
      <input type="hidden" name="Form Signup Location" value="popup_main">
    </div>
  </div><label style="display: none !important;">Leave this field empty if you're human: <input type="text" name="_mc4wp_honeypot" value="" tabindex="-1" autocomplete="off"></label><input type="hidden" name="_mc4wp_timestamp"
    value="1652465178"><input type="hidden" name="_mc4wp_form_id" value="138094"><input type="hidden" name="_mc4wp_form_element_id" value="mc4wp-form-7">
  <div class="mc4wp-response"></div>
</form>

Text Content

 * Blog
   * View All
   * Top Posts
   * Submissions
   * About
 * Topics
   * Artificial Intelligence
   * Career Advice
   * Computer Vision
   * Data Science
   * Machine Learning
   * NLP
   * News
   * Programming
   * Python
   * SQL
 * Datasets
 * Courses
   * Online
   * Certificates
   * Webinars
 * Resources
   * Events
   * Jobs
   * Software

 * 
 * 
 * 

Join Newsletter

 


Bay Path MS Degree in Applied Data Science: Maximize value with data


PYTHON LIBRARIES DATA SCIENTISTS SHOULD KNOW IN 2022

Let’s have a look at the Python libraries that every data scientist should know
in 2022, to maintain and improve their coding journey.

By Nisha Arya, KDnuggets on April 11, 2022 in Python


--------------------------------------------------------------------------------


As more people enter the tech world trying to tackle Data Scientists, Data
Analysts, Machine Learning Engineer roles, and more; the programming language
Python becomes more popular. Due to its simplified syntax, the Python language
is known to be one of the most accessible programming languages available.

As Data Science becomes more popular, there are new libraries that are being
released to help solve the challenges faced in Data Science. It can be very
overwhelming to learn the ins and outs of libraries; however, there are some
that are vital to our learning.
Below are Python libraries that every Data Scientist should know in 2022, to
maintain and improve their coding journey. 

 


PANDAS

 




Source: Wikipedia

 



Pandas was created by Wes McKinney in 2008, as a Python library for data
manipulation and analysis. Wes McKinney built Pandas based on their need for a
powerful and flexible analysis tool. 



Pandas can deal with:

 * Handling missing data (represented as NaN)
 * Flexible reshaping and pivoting of datasets
 * Indexing, manipulation, renaming, merging, and joining of datasets
 * Time series-specific functionality
 * and much more

Core Task: Data Manipulation and Analysis

How to install Pandas: Pandas Installation 

pip install pandas


 

Get the Book: Python for Data Analysis by Wes McKinney

 


NUMPY

 



Source: Wikipedia

 



NumPy is another library used for Python, which is used for mathematical
functions. It is popular in processing multidimensional array objects, and
various derived objects (such as masked arrays and matrices) and is mostly used
in machine learning computations. The software includes linear algebra, Fourier
transform, and matrix calculation functions. 

NumPy can deal with:

 * Array operations such as add, multiply, cut, sort, index
 * Working with linear algebra
 * Basic slicing and advanced indexing in Numpy Python
 * Adding/Removing/Sorting Elements

Core Task: Processing arrays, using mathematical functions

How to install NumPy: NumPy Installation

pip install numpy


 


SCIPY

 



Source: SciPy
 



SciPy stands for Scientific Python. SciPy is a free and open-source Python
library, which is a collection of mathematical algorithms and functions built
mainly on the NumPy extension of Python. 

SciPy:

 * Can manipulate and visualize data
 * contains a variety of sub-packages that help to solve the most common
   challenges and problems related to scientific computation.
 * Can deal with linear algebra, integration, ordinary differential equations,
   calculus, and signal processing
 * Is easy to use and understand and has a fast computational power.
 * It can operate on an array of NumPy libraries.

Core Task: Solve scientific and mathematical problems

How to install SciPy: SciPy Installation

pip install scipy



conda install scipy


 

 


MATPLOTLIB

 



Source: GitHub
 



Matplotlib is a numerical extension of NumPy, which is a cross-platform, data
visualization and graphical plotting library for Python. It is used in
conjunction with NumPy to provide an effective environment that is an
open-source alternative for MatLab. 

Matplotlib can:

 * Create quality plots of data.
 * Create Line charts, Scatter charts, Bar charts and histograms, Pie charts,
   Stem plots, Spectrograms
 * Make interactive figures that can zoom in and out, pan, and update.
 * Customize the style and layout of the visualisation.
 * Export to different file formats

Core Task: Creating static, animated, and/or interactive visualizations in
Python
How to install Matplotlib: Matplotlib Installation

pip install matplotlib



conda install matplotlib


 

GitHub: Matplotlib
Tutorials: Matplotlib tutorials 

Books for further reading:

 * Mastering matplotlib by Duncan M. McGreggor
 * Interactive Applications Using Matplotlib by Benjamin Root
 * Matplotlib for Python Developers by Sandro Tosi

 


SEABORN

 



 



Seaborn is a library that has been built on top of matplotlib and is closely
integrated with pandas data structures. It provides a high-level interface for
drawing attractive and informative statistical graphs using its plotting
functions to help you further explore and understand your data. 

Seaborn can:

 * Create Scatter Plot. Histogram, Bar Plot, Box and Whiskers Plot, and more
 * show a linear relationship between two or three data points
 * comfortably handle Pandas’ data frames more than matplotlib
 * Perform semantic mapping and statistical aggregation to produce informative
   plots.

Core Task: Making statistical graphics in Python

How to install Seaborn: Seaborn Installation

pip install seaborn



conda install seaborn


 


SCIKIT-LEARN

 



Source: Wikipedia
 



Scikit-learn is a free software machine learning library, that contains
effective tools for machine learning and statistical modeling such as
classification, regression, clustering, and dimensionality reduction.

The main benefits of sci-kit learn are that it is open-source, easy to use,
properly documented, and versatile used.

Scikit-learn can be used in:

 * Supervised learning and Unsupervised learning
 * Clustering and Dimensionality Reduction
 * Ensemble methods
 * Cross-validation
 * Feature extraction and selection

Core Task: Machine learning and statistical modeling

How to install Sci-kit Learn: Sci-kit Learn Installation

pip install scikit-learn


 

Further reading:

 * Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow by
   Aurelien Geron 

 


TENSORFLOW

 



Source: Wikipedia
 



TensorFlow was built by the Google Brain Team and is an open-source library for
deep learning applications. Tensorflow also makes it easy to build deep learning
models by helping developers create large-scale neural networks with many layers
using data flow graphs.

TensorFlow can/have been used on:

 * Voice and sound recognition
 * Sentiment analysis, classifying texts
 * Text applications such as Google Translate, Gmail, and more. 
 * Facial recognition such as Facebook Deep Face, Photo tagging, and more

Core Task: Develop and train models using Python

How to install TensorFlow: TensorFlow Installation

pip install tensorflow


 

Books for further reading:

 * Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow by
   Aurelien Geron 
 * Learning TensorFlow: A Guide to Building Deep Learning Systems by Itay
   Lieder, Tom Hope, and Yehezkel S. Resheff
 * TensorFlow for Deep Learning: From Linear Regression to Reinforcement
   Learning by Bharath Ramsundar and Reza Bosagh

 
 
Nisha Arya is a Data Scientist and Freelance Technical Writer. She is
particularly interested in providing Data Science career advice or tutorials and
theory-based knowledge around Data Science. She also wishes to explore the
different ways Artificial Intelligence is/can benefit the longevity of human
life. A keen learner, seeking to broaden her tech knowledge and writing skills,
whilst helping guide others.
 





MORE ON THIS TOPIC

 * Top Posts Apr 11-17: Python Libraries Data Scientists Should Know in 2022
 * Top Posts May 2-8: 9 Free Harvard Courses to Learn Data Science in 2022
 * Top Posts April 18-24: Decision Tree Algorithm, Explained
 * Top Posts April 25 - May 1: 15 Python Coding Interview Questions You Must…
 * Top Posts Jan 31 - Feb 6: 7 Steps to Mastering Machine Learning with Python…
 * Top Posts Jan 24-30: The High Paying Side Hustles for Data Scientists



Get the FREE collection of 50+ data science cheatsheets and the leading
newsletter on AI, Data Science, and Machine Learning, straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

Leave this field empty if you're human:




--------------------------------------------------------------------------------


<= Previous post
Next post =>




TOP POSTS PAST 30 DAYS

 1.  Decision Tree Algorithm, Explained
 2.  9 Free Harvard Courses to Learn Data Science in 2022
 3.  Naïve Bayes Algorithm: Everything You Need to Know
 4.  15 Python Coding Interview Questions You Must Know For Data Science
 5.  Top Programming Languages and Their Uses
 6.  Python Libraries Data Scientists Should Know in 2022
 7.  Why Are So Many Data Scientists Quitting Their Jobs?
 8.  5 Different Ways to Load Data in Python
 9.  DBSCAN Clustering Algorithm in Machine Learning
 10. Top YouTube Channels for Learning Data Science


Online Certification in Data Science from UNH Online




LATEST NEWS

 * The Curse of Delayed Performance
 * Centroid Initialization Methods for k-means Clustering
 * oBERT: Compound Sparsification Delivers Faster Accurate...
 * The “Hello World” of Tensorflow
 * Deep Learning For Compliance Checks: What’s New?
 * Can We Query a Table with T5?


TOP POSTS LAST WEEK



 1. 9 Free Harvard Courses to Learn Data Science in 2022
 2. Decision Tree Algorithm, Explained
 3. 15 Python Coding Interview Questions You Must Know For Data Science
 4. Naïve Bayes Algorithm: Everything You Need to Know
 5. Software Developer vs Software Engineer


MORE RECENT POSTS

 * Can We Query a Table with T5?
 * 5 Free Hosting Platform For Machine Learning Applications
 * Top 4 tricks for competing on Kaggle and why you should start
 * Create Efficient Combined Data Sources with Tableau
 * Data Mesh Architecture: Reimagining Data Management
 * KDnuggets News, May 11: SQL Notes for Professionals; How To St...
 * Quick Data Science Tips and Tricks to Learn SAS
 * 4 Steps for Managing a Data Science Project
 * Machine Learning’s Sweet Spot: Pure Approaches in NLP and Do...
 * Free University Data Science Resources


 * RELATED POSTS
   
    * KDnuggets Top Posts for January 2022: Is Data Science a Dying Career?
    * Top Posts Jan 10-16: Is Data Science a Dying Career?
    * Top Posts Jan 3-9: Why Do Machine Learning Models Die In Silence?
    * KDnuggets™ News 22:n04, Jan 26: The High Paying Side Hustles for Data…
    * KDnuggets News, April 13: Python Libraries Data Scientists Should Know in…
    * Top Posts Feb 7-13: Decision Tree Algorithm, Explained
   
   
 * 
 * 
   
 * 


 * GET THE LATEST NEWS!
   
   Get the FREE collection of 50+ data science cheatsheets and the leading
   newsletter on AI, Data Science, and Machine Learning, straight to your inbox.
   
   By subscribing you accept KDnuggets Privacy Policy
   
   Leave this field empty if you're human:
   

KDnuggets Home » » Python Libraries Data Scientists Should Know in 2022


© 2022 KDnuggets. | About KDnuggets  | Contact  | Privacy policy  | Terms of
Service
 
Published on April 11, 2022 by Nisha Arya

Subscribe To Our Newsletter (Get 50+ FREE Cheatsheets)


Leave this field empty if you're human:



 


Get the FREE collection of 50+ data science cheatsheets and the leading
newsletter on AI, Data Science, and Machine Learning, straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

Leave this field empty if you're human:

×


Get the FREE collection of 50+ data science cheatsheets and the leading
newsletter on AI, Data Science, and Machine Learning, straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

Leave this field empty if you're human:

×