NLP:AssignmentTwo Sentimentclassificationforsocialmedia
Buget £20-250 GBP
Job Description:
Topic: Building a sentiment classifier for Twitter
Semeval competitions involve addressing different challenges pertaining to the extraction of meaning
from text (semantics). The organisers of those competitions provide a dataset and a task, so that
different participants can develop their system. In this exercise, we will focus on the Task 4 of Semeval
2017 ([login to view URL]). We will focus particularly on Subtask A, i.e. classifying
the overall sentiment of a tweet as positive, negative or neutral.
As part of the classification task, you will need to preprocess the tweets. You may
want to tweak your preprocessing code to deal with particularities of tweets, e.g. #hashtags or @user
mentions.
Exercise guidelines
• Data: The training, development and test sets can be downloaded from the module website
([login to view URL]). This compressed archive includes 5 files, one that is used for training
([login to view URL]) another one for development ([login to view URL]) and another 3
that are used as different subsets for testing (twitter-test[1-3].txt). You may use the
development set as the test set while you are developing your classifier, so that you tweak your
classifiers and features; the development set can also be useful to compute hyperparameters,
where needed. The files are formatted as TSV (tab-separated-values), with one tweet per row
that includes the following values:
tweet-id<tab>sentiment<tab>tweet-text
where sentiment is one of {positive, negative, neutral}. The tweet IDs will be used as unique
identifiers to build a Python dictionary with the predictions of your classifiers, e.g.:
predictions = {‘163361196206957578’: ‘positive’,
‘768006053969268950’: ‘negative’,
…}
• Classifier: You are requested to develop classifiers that learn from the training data and test
on each of the 3 test sets separately (i.e. evaluating on 3 different sets). You are given the
skeleton of the code ([login to view URL]), with evaluation script included, which will
help you develop your system in a way that we will then be able to run on our computers.
Evaluation on different tests allows you to generalise your results. You may achieve an
2
improvement over a particular test set just by chance (e.g. overfitting), but improvement over
multiple test sets makes it more likely to be a significant improvement.
You should develop at least 3 different classifiers, which you will then present and compare in
your report. Please develop at least 2 classifiers based on (1) traditional machine learning
methods such as MaxEnt, SVM or Naïve Bayes trained on different sets of features (you could
use Scikit-learn library). Then, train an another classifier based on the LSTM using PyTorch
(and optionally the torchtext library) by following the steps below:
a) Download the GloVe word embeddings and map each word in the dataset into its pretrained GloVe word embedding.
First go to [login to view URL] and download the pre-trained
embeddings from 2014 English Wikipedia into the "data" directory. It's a 822MB zip file
named [login to view URL], containing 100-dimensional embedding vectors for 400,000
words (or non-word tokens). Un-zip it. Parse the un-zipped file (it's a txt file) to build an
index mapping words (as strings) to their vector representation (as number vectors).
Build an embedding matrix that will be loaded into an Embedding layer later. It must be a
matrix of shape (max_words, embedding_dim), where each entry i contains the
embedding_dim-dimensional vector for the word of index i in our reference word index
(built during tokenization). Note that the index 0 is not supposed to stand for any word or
token -- it's a placeholder.
40 freelanceri licitează în medie 176£ pentru acest proiect
Top 1% in Freelancer.com Hi, Greetings! ✅checked your project details: ✅Completed Time: In project deadline We have worked on 850 + Projects. I have 6 + years of the experience in same kind of projects. If you Mai multe
Hi, I hope you are doing fine. I have almost 10 years of experience in machine learning algorithms. I can implement various types of artificial intelligence algorithms including yours with Matlab, Python and etc. I hav Mai multe
Hello, I am Python developer, and I can help you with your assignment two sentiment classification for social media.
Hi , I am sure that I can do this job. I'm artificial intelligence engineer experienced in NLP, Machine/Deep learning using Python programming. Feel free to contact me for further details because I am looking forward Mai multe
I am a machine learning expert with 4 years of experience in freelancing. I have successfully delivered 110 projects in the past. My expertise includes deep learning, machine learning, natural language processing, com Mai multe
Hi, Dear Employer; I am Al.A from Korea. I have read your job post carefully. I can write clean, validated Python and Machine Learning code and make a device-supported M. File. I have over seven-plus years of experien Mai multe
Hello Snowboy99, We would like to grab this opportunity and will work till you get 100% satisfied with our work. Please come over chat and discuss your requirement in a detailed way. Regards
Hi there, How are you? I can work with your project requirements mentioned. I can definitely pull it off once provided with detailed file of your requirements. Some points are needed to be discussed before starting so Mai multe
Hi I am NLP scientist, I have worked with a lot of sentiment classifiers using svm naive bayes and even transformer based models. I have read your exercise description and seems to be a straight forward model training Mai multe
MACHINE LEARNING Expert...... Hello dear, Give me a couple of minutes please MACHINE LEARNING Engineer experience over 4 years I can start right now I love machine learning tasks So I am confident from my ability to Mai multe
Hello, I hope this message finds you well, Thanks for posting such an interesting project. I'm the exact type of contractor you are searching for. Having worked on similar projects for the past 10 years, I can handle Mai multe
Implementing projects using Python and Machine Learning is our core forte since we are working on it for more than 5+ years now. We are a team of 50+ developers who have successfully delivered 350+ Machine Learning Pro Mai multe
Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Mai multe
Hey! Speaking of this project NLP:Assignment Two Sentiment classification for social media, I am delighted to work on it. Being the most reputed freelance writer on this website, with over a decade of experience and ma Mai multe
✦◼️✦ "Full-5 Stars Rating✦◼️✦ - ✦100% Marks Guaranteed!✦ I have 9+ years of experience in managing academic writing jobs successfully. I have done my Master's in English Literature, but I have worked with clients co Mai multe
Hi there, How r u? I have had a look and i am sure that i can handle this project well as i have experience in Algorithm, Java, Data Mining, Python and Machine Learning (ML). I have worked on similar projects before to Mai multe
Hi sir/madam. I just completed reading all requirements & description carefully, I have 5years+ Experience in Python, Machine learning, Data structures, Data Mining, Java. If you hire me i will give you free revisions Mai multe
Hello I am a professional python developer. My main specializations are automation, web scrapers and bots development. I have already developed over 200 scrapers. From the simplest (for example, a competitor's price Mai multe
I have read and understood all your project details "NLP:Assignment Two Sentiment classification for social media" and I feel my self the best candidate to complete this project with 100 percent accuracy. I have vast Mai multe
My name is "Usama Safdar" and I am a Ph.D degree holder which means I am highly-capable to tackle this project "NLP:Assignment Two Sentiment classification for social media" with 100 percent accuracy. I am a professi Mai multe