Lecture 5 - Part b
Natural Language Processing
High-level understanding of the language spoken and written by humans
Also, generation (e.g., ChatGPT)
An enabler for technology like Siri or Alexa
Digital, or digitised
Computer using natural language as input and/or output
Human languages are messy, ambiguous, and ever-changing
A string may have many possible interpretations at every level
The correct resolution of the ambiguity will depend on the intended meaning, which is often inferable from the context
There is tremendous diversity in human languages
Languages express the same kind of meaning in different ways
Some languages express some meanings more readily/often
Knowledge Bottleneck
About language
About the world: Common sense and Reasoning
Christopher Robin is alive and well. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchford Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book
Who wrote Winnie the Pooh?
Where did Chris live?
The presence of two or more possible meanings within a single word
The presence of two or more possible meanings within a single sentence or sequence of words
The policeman shot the thief with the gun
Every fifteen minutes a woman in this country gives birth. Our job is to find this woman, and stop her!
Groucho Marx
“... given some document collection, the frequency of any word is inversely proportional to its rank in the frequency table...”
LOL | Laugh out loud |
G2G | Got to go |
BFN | Bye for now |
B4N | Bye for now |
Idk | I don't know |
FWIW | For what it's worth |
LUWAMH | Love you with all my heart |
It uses dictionaries and morphological analysis of words to return the base or dictionary form of a word
Example: Lemmatization of saw —> attempts to return see or saw depending on whether the use of the token is a verb or a noun
Tagging each word in a sentence with a corresponding part-of-speech (e.g. noun, verb, adverbs)
The detection of attitudes
"enduring, affectively colored beliefs, dispositions towards objects or persons”ML4D Course Description
ML4D Course Description
Disambiguation depends on context!
https://textsummarization.net/
https://brevi.app/single-demo (not working!)
Credits: Nava Tintarev
Lecture 5 - Part b
Natural Language Processing
CIS 419/519 Applied Machine Learning. Eric Eaton, Dinesh Jayaraman. https://www.seas.upenn.edu/~cis519/spring2020/
EECS498: Conversational AI. Kevin Leach. https://dijkstra.eecs.umich.edu/eecs498/
CS 4650/7650: Natural Language Processing. Diyi Yang. https://www.cc.gatech.edu/classes/AY2020/cs7650_spring/
Natural Language Processing. Alan W Black and David Mortensen. http://demo.clab.cs.cmu.edu/NLP/
IN4325 Information Retrieval. Jie Yang.
Speech and Language Processing, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Third Edition. Daniel Jurafsky, James H. Martin.
Natural Language Processing, Jacob Eisenstein, 2018.