Topic 7: Intro to text and speech analytics Flashcards
NLP
this is about what goes into getting computers to perform useful and interesting task involving human languages
concerned with insights that such computational work gives us into human processing of language
Why NLP?
- enourmous amount of knowledge is available in machine readable form as natural language text
- conversational agents are becoming important form of human-computer interaction
- much human-human communication is now mediated by computers
Application of NLP
- Google translate
- web Q/A
- weblog analytics (product marketing info, political opinion tracking, social network analysis, buzz analysis)
- web analytics ( cambridge analytica mine data to analyse the strategic communication during electoral processes.
Major topics
- Words
- Syntax
- Meaning
what makes application a language processing application?
- requires the use of knowledge about human language
word count yes
count line / bytes no
Big application of NLP
- Q/A
- conversational agents
- summarization
- machine translation
what is needed for NLP?
- speech recognition and synthesis
- knowledge of the english words
- how group of words clump
- dialog
knowledge required in HAL
- determine what user is saying. (phonetics and phonology)
- recognizing variation of words (morphology)
- understand syntax / structure of sentence
- semantics meaning understanding
- implicit usage of languages . pragmatics
- discourse. managing larger linguistic units
why NLP is considered part AI solution?
- dealing with ill-defined problems
- don’t have exact solutions/algorithm
aim of conversational agents is to have spontaneous dialogue
categories of knowledge in NLP
phonology morphology syntax semantics pragmatics discourse
why NLP difficult?
- non-standard english
- segmentation issues
- idioms
- neologisms (newly coined words/expression)
- multilingual
- tricky entity names
ambiguity
computational linguist are obsessed with ambiguity.
fundamental problem
resolving ambiguity is an important goal
example
i made her duck
POS
Word sense disambiguation
Syntactic disambiguation
Text and speech application
small (standalone)
- spelling correction
- hyphenation
medium (enabler app)
- WSD
- name entity recognition
- information retrieval
large (funding/business plans)
- Q/A
- conversational agent
- machine translation
- speech-to-speech translation
- context to speech system