Skip to content

Natural Language Processing (classification and machine translation) codes and analysis done for the year long practicum in Dublin City University (2019-20)

Notifications You must be signed in to change notification settings

MrRaghav/dcu_practicum_2019-20

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

This repository is dedicated to the research done during the M.Sc. in Computing from School of Computing, Dublin City University (Ireland) in 2019-2020.

Area of Research:

Natural Language Processing - text classification and machine translation

Tools, Technologies and Algorithms:

Data collection, annotation

Data processing - stopwords, special characters, numbers, lemmatisation, tokenization

Similarity network - dice coefficient and word2vec

graph based classifiers - min cut max flow, random walk

fasttext classifier

neural network classifiers - ANN, CNN, Bi-LSTM

basic classifiers - Naive Bayes, Logistic regression, decision tree, support vector machine, random forest

machine translation - Amazon sockeye

Metrics used:

Precision, recall, F1 score, AUC, cohen kappa score, BLEU score

Supervisor:

Prof. Andy Way (Dublin City University), Ex-President : European Association for Machine Translation (EAMT) & International Association for Machine Translation (IAMT)

https://www.computing.dcu.ie/~away/

Co-Supervisor:

Dr. Rejwanul Haque

https://www.computing.dcu.ie/~rhaque/students.html

Copyright - Raghvendra Pratap Singh, 2020