Three common tasks in computational semantics:
obtaining structured information from language data
RE systems typically target a particular domain, e.g.:
"Gryffindor values courage, bravery, nerve, and chivalry. Gryffindor's mascot is the lion, and its colours are scarlet and gold. The Head of this house is the Transfiguration teacher and Deputy Headmistress, Minerva McGonagall until she becomes headmistress, and the house ghost is Sir Nicholas de Mimsy-Porpington, more commonly known as Nearly Headless Nick. According to Rowling, Gryffindor corresponds roughly to the element of fire. The founder of the house is Godric Gryffindor."
Templates:
If parsers/NER-taggers/Chunkers are available, templates can refer to their output:
or
Use parsed and annotated text to train text classifiers
E.g. decide for each pair of named entities (PERSON and ORGANIZATION) whether they are in the "ceo_of" relationship, based on context features
Features typically include:
When little or no training data is available, we must use what we have to generalize:
seed tuple: author(William_Shakespeare, Hamlet)
found instances:
extracted patterns:
Finally, use these patterns to find new seeds
Also called opinion mining - the task of extracting opinions, emotions, attitudes from user-generated text, e.g. about products, movies, or politics
Use training data, extract standard features such as:
Use these to train standard classifiers, e.g. Naive Bayes, SVM, MaxEnt, etc.
One of the oldest and most popular tasks in AI
Recent products include Apple Siri, Amazon Alexa, or IBM's Watson
They’re the two states you could be reentering if you’re crossing Florida’s northern border
Supervised learning can be used to train a classifier on annotated data. See also Li & Roth 2002
From a slide by Mihai Surdenau
Some features for learning to rank:
Source: Jurafsky-Manning slides
Approaches to ranking can be:
see Agarwal et al. 2012 for a short survey of algorithms
Whose granddaughter starred in E.T.?
(acted-in ?x “E.T.”)
(granddaughter-of ?x ?y)