BlogPost5 Natural Language Processing (NLP) using libraries from the Hugging Face ecosystem
Published:
Motivation
It has been a few years that I haven’t seriously code an NLP product. It turned out that I am a bit outdated. However, I am planning to come back in 2023. This a side note of my revision about NLP using Hugging Face platform. I recall that Hugging Face was just a github page providing BERT in pytorch in 2019. Now, it has grown into a giant ecosystem for NLP. Time really flies.
There will be jupyter notebook associating to each chapter
My note and flow of understanding/thinking about the course
Table of contents
A course from Hugging Face Hub.
Setup and Introduction
Although I am already experienced with NLP It is worth taking a short recap of the theory a little bit.
- NLP is a field of study which uses machine learning to understand human related language. Not only single words individually but to be able to understand the context of those words.
Common NLP tasks including:
Classifying whole sentences: i.e. detect spam email
Classifying each word in a sentence: the grammatical component of a sentence. of named entities (person, location, organization)
Extracting an answer from a text (a.k.a question answering task): given a question and a context, extracting the answer to the question based on the information provided.
Generating a new sentence from an input text: Translating a text into new language, summarizing a text