This three-part workshop
series covers the basics of text mining with Python. The series focuses
primarily on unstructured text data, discussing how to format and clean text to
enable the discovery of significant patterns in collections of documents.
Sessions will introduce participants to core terminology in text mining/natural
language processing and will walk through different methods of ranking terms
and documents. It concludes by using these methods to classify texts and to
build models of “topics.” Basic familiarity with Python is required.
Workshop dates were
February 14, February 16, and February 18, 2022.
The copyright on this video is owned by the
Regents of the University of California and is licensed for reuse under the
Creative Commons Attribution 4.0 International (CC BY 4.0) License.