Loading...

Course Description

In this course, you will start to use machine learning methods to further your exploration of document term matrices (DTM). You will use a DTM to create train and test sets with the scikit-learn package in Python — an important first step in categorizing different documents. You will also examine different models, determining how to select the most appropriate model for your particular natural language processing task. Finally, after you have chosen a model, trained it, and tested it, you will work with several evaluation metrics to measure how well your model performed. The technical skills and evaluation processes you study in the course will provide valuable experience for the workplace and beyond.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Natural Language Processing Fundamentals
  • Transforming Text Into Numeric Vectors

Faculty Author

Dr. Oleg Melnikov

Benefits to the Learner

  • Create train and test sets from document term matrices
  • Train classification models to categorize documents
  • Evaluate the model on the test set to measure how well it generalizes

Target Audience

  • Engineers
  • Software developers
  • Computer scientists new to NLP
  • Data scientists
  • Analysts
  • Researchers
  • Linguists

Applies Towards the Following Certificates

Loading...
Enroll Now - Select a section to enroll in
Type
3 week
Dates
May 22, 2024 to Jun 11, 2024
Total Number of Hours
24.0
Course Fee(s)
Regular Price $1,199.00
Type
3 week
Dates
Jul 31, 2024 to Aug 20, 2024
Total Number of Hours
24.0
Course Fee(s)
Regular Price $1,199.00
Type
3 week
Dates
Oct 09, 2024 to Oct 29, 2024
Total Number of Hours
24.0
Course Fee(s)
Regular Price $1,199.00
Type
3 week
Dates
Dec 18, 2024 to Jan 07, 2025
Total Number of Hours
24.0
Course Fee(s)
Regular Price $1,199.00
Required fields are indicated by .