Data Processing Tools with NumPy

you own this product
prerequisites
basic Python (Jupyter Notebook, NumPy, RegEx) • basic matrix operations • basic knowledge of trie data structure • basic knowledge of TF-IDF and SVD
skills learned
extract features with NumPy • vectorize text with TF-IDF and SVD • compute similar items with embedded vectors
1 week · 4-6 hours per week · INTERMEDIATE

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


Look inside

As a data engineer in an online recruiting tech company or a large organization’s HR department, you’ll build a series of practical tools to process and extract useful information from unstructured text data using NumPy. You’ll learn important methods (including trie data structure, TF-IDF, SVD), how to implement them, and their applications in the real world. When you’re finished, you’ll have the know-how to build data processing tools that meet the needs of machine learning engineers, data analysts, and product developers.

This project is a part of the series DS Pipeline with Python.
This project is designed for learning purposes and is not a complete, production-ready application or solution.

prerequisites

This liveProject is for Python beginners who are interested in building data processing tools using NumPy. To begin these liveProjects you’ll need to be familiar with the following:

TOOLS
  • Basic Python
  • Basic Jupyter Notebook/JupyterLab
  • Basic NumPy and pandas
TECHNIQUES
  • Basic matrix operations
  • Basic knowledge of tree data structure
  • Basic concept of TF-IDF, SVD (what they’re named for and used for)
  • Basic understanding of tokenization and cleaning of text data

features

Self-paced
You choose the schedule and decide how much time to invest as you build your project.
Project roadmap
Each project is divided into several achievable steps.
Get Help
While within the liveProject platform, get help from fellow participants and even more help with paid sessions with our expert mentors.
Compare with others
For each step, compare your deliverable to the solutions by the author and other participants.
book resources
Get full access to select books for 90 days. Permanent access to excerpts from Manning products are also included, as well as references to other resources.
choose your plan

team

monthly
annual
$49.99
$499.99
only $41.67 per month
  • five seats for your team
  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose another free product every time you renew
  • choose twelve free products per year
  • exclusive 50% discount on all purchases
  • renews monthly, pause or cancel renewal anytime
  • renews annually, pause or cancel renewal anytime
  • Data Processing Tools with NumPy project for free