Natural Language Processing(NLP)(I of 4) Structuring a collection of text Old approach: bag-of-words New approach: natural language processing NLP is a very important concept in text mining a subfield of artificial intelligence and computational linguistics the studies of "understanding" the natural human anglade Syntax versus semantics-based text mining Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Natural Language Processing (NLP) (1 of 4) • Structuring a collection of text – Old approach: bag-of-words – New approach: natural language processing • NLP is … – a very important concept in text mining – a subfield of artificial intelligence and computational linguistics – the studies of "understanding" the natural human language • Syntax versus semantics-based text mining
Natural Language Processing (NLP)(2 of4 What is"Understanding"? Human understands what about computers? Natural language is vague context driven True understanding requires extensive knowledge of a topIC Can/will computers ever understand natural language the samelaccurate way we do? Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Natural Language Processing (NLP) (2 of 4) • What is “Understanding”? – Human understands, what about computers? – Natural language is vague, context driven – True understanding requires extensive knowledge of a topic – Can/will computers ever understand natural language the same/accurate way we do?
Natural Language Processing (NLP)(3 of 4) Challenges in NLP Part-of-speech tagging Text segmentation Word sense disambiguation Syntax ambiquity Imperfect or irregular input Speech acts Dream of Al community to have algorithms that are capable of automatically reading and obtaining knowledge from text Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Natural Language Processing (NLP) (3 of 4) • Challenges in NLP – Part-of-speech tagging – Text segmentation – Word sense disambiguation – Syntax ambiguity – Imperfect or irregular input – Speech acts • Dream of AI community – to have algorithms that are capable of automatically reading and obtaining knowledge from text
Natural Language Processing (NLP)(4 of 4) Wordnet a laboriously hand-coded database of English words their definitions, sets of synonyms, and various semantic relations between synonym sets A major resource for NLP Need automation to be completed Sentiment Analysis a technique used to detect favorable and unfavorable opinions toward specific products and services SentiWordNet Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Natural Language Processing (NLP) (4 of 4) • WordNet – A laboriously hand-coded database of English words, their definitions, sets of synonyms, and various semantic relations between synonym sets – A major resource for NLP – Need automation to be completed • Sentiment Analysis – A technique used to detect favorable and unfavorable opinions toward specific products and services – SentiWordNet
Application Case 5.2 (1 of2 AMC Networks Is Using Analytics to Capture New viewers, Predict Ratings, and Add value for Advertisers in a Multichannel World A Web-Based dashboard Used by aMc Networks Deivery by teles Depart w pror wreck Source: AMC Networks Pearson Copyright C 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved
Copyright © 2018, 2014, 2011 Pearson Education, Inc. All Rights Reserved Application Case 5.2 (1 of 2) AMC Networks Is Using Analytics to Capture New Viewers, Predict Ratings, and Add Value for Advertisers in a Multichannel World A Web-Based Dashboard Used by AMC Networks [Source: AMC Networks]