Skip to content

CSCI 544 with Ron Artstein - Fall 2020 at USC Master's program

Notifications You must be signed in to change notification settings

IsmaelV/NLP-Fall-2020

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSCI 544 - Applied Natural Language Processing

Fall 2020

Student: Ismael Villegas-Molina

Instructor: Ron Artstein

This is the work written by Ismael Villegas-Molina for the Applied NLP course taught by Ron Artstein.

Homework 1: Full Name Predictor

Given a list of training data (with the input as two names and the output as the target full name), predict the full name of the first person in the input.

Homework 2: Lookup Lemmatizer

Very simple lemmatizer, which learns a lemmatization function from an annotated corpus.

Homework 3: Naive Bayes Classifier

Perform a Naive Bayes classification to identify hotel reviews as either truthful or deceptive, and either positive or negative. Use the word tokens as features for classification.

Homework 4: Perceptron Classifier

Perform a perceptron classification (vanilla and averaged) to identify hotel reviews as either truthful or deceptive, and either positive or negative. Uses the word tokens as features, or any other features devised from the text.

Homework 5: Hidden Markov Model Part-of-Speech Tagger

A Hidden Markov Model part-of-speech tagger for Italian and Japanese. The training data is provided tokenized and tagged; the test data will be provided tokenized, and the tagger will add the tags.

About

CSCI 544 with Ron Artstein - Fall 2020 at USC Master's program

Topics

Resources

Stars

Watchers

Forks

Languages