Web-crawling & scraping of Interviews with film industry professionals

Închis Postat la May 5, 2016 S-au achitat serviciile după ce au fost prestate
Închis S-au achitat serviciile după ce au fost prestate

This project involves locating interviews with particular film industry professionals (directors, producers and actors/actresses) from a defined list of websites/magazines/newspapers, scraping the text of each interview and storing it in a separate text file (using the following naming convention: [personID][interview number (001-xxx)].txt).

At the same time, you will keep track of the interviews you found in a master list (columns are: Interview number, url of where the interview is located, date of the interview [if available]). You are asked to collect a minimum of 10 interviews per person available from a shortlist of sources.

The project will consist of the following steps for each list of persons:

1. Determine method of access to the intended data sources (websites/magazines/newspapers), for which we will provide a list of 10.

2. Query these sources for the persons on the list, examine if interview contains evidence of the interviewee being quoted (quotation marks in combination with prose, name in combination with verb indicative of speech) and scrape the interview if it meets the aforementioned criterion. Save the speech parts of the interview in a secondary file.

3. Supplement where necessary with top hits in a Google search (name person + interview), determine which of these are from sources not included in the list used for step 1, and execute Step 2 on the additional sources found until a sufficient number of interviews per person is reached.

Lists include:

1. Directors: 306 (overlaps with producers for 110 professionals)

2. Producers: 418 (overlaps with directors for 110 professionals)

3. Actors/actresses: 697

The person taking this job has experience with web crawlers and text scraping, can work with a wide range of source material for text scraping, has a proactive attitude and is a creative problem-solver.

If this is you, we look forward to your application.

Introducere date Exploatarea Datelor Procesare date Web Scraping Căutare Web

ID Proiect: #10421921

Detalii despre proiect

18 propuneri Proiect la distanță Activ acum 7 ani

18 freelanceri plasează o ofertă medie de 466$ pentru proiect

Marie1234

A proposal has not yet been provided

$315 CAD în 1 zi
(275 recenzii)
7.5
diamond247

We are a team (19 operators) here, giving all data entry, research and scraping service world wide with best quality output ,gone through your project description, we are experienced enough to collect the data from sev Mai multe

$400 CAD în 10 zile
(255 recenzii)
7.2
Verz1Lka

Hello! I'm web scraping expert and i can done your project. I use python language and scrapy framework. My scripts works on windows, mac or linux, but linux is preferably. I can schedule scripts on server if it is req Mai multe

$399 CAD în 10 zile
(107 recenzii)
6.5
seoguru17

Wide experience in Research and Scraping.I have gone through from your job posting and i would like to discuss few questions that i have

$333 CAD în 10 zile
(95 recenzii)
6.3
vlayausa

Hello, I am interested in this project. I am looking forward to working on it, because it is connected with film industry, and I love watching movies and I know a lot about actors, directors and other things... Mai multe

$300 CAD în 7 zile
(93 recenzii)
5.5
sylar1015

hello, sir: c/c++/python expert worked for samsung & huawei maybe more details will be helpful a sample can be provided before hired. hope to get message from u ty

$400 CAD în 10 zile
(16 recenzii)
4.7
jinigo23

Hi there! I've good knowledge and experience in this kind of work. I will complete your project within the time stated. I am ready to start the work immediately. I will do my best. I've added some of my previous projec Mai multe

$340 CAD în 10 zile
(10 recenzii)
5.0
Elsa22

Hello, I am a freelancer from Sweden. I have done similar projects on data entry and Web scrapping. Moreover I have got good feedbacks too. So if you are willing to hire me I will assure to do a quality work as I have Mai multe

$500 CAD în 10 zile
(13 recenzii)
4.4
dghq123

Hello, I am computer science grad and had done many scrapping projects. let me know the URL. i can start now.

$555 CAD în 10 zile
(4 recenzii)
4.0
mike199

My name is Mike and I’m from UK. I work with individual clients and also provide outsourcing services for a number of UK and USA based agencies. Your project description sounds interesting to me and I do have skills & Mai multe

$555 CAD în 10 zile
(0 recenzii)
0.0
SharePointExper

Hey, this seems like a very interesting project. I can help you with this with a powerful took which I've designed for these types of projects. Specifically to extract the text from a source and send it to a second fil Mai multe

$600 CAD în 4 zile
(0 recenzii)
0.0
Shopify

I want to discuss this project with you further, let me know the best suitable time for you to schedule the meeting, Feel free to message me at any time, i used to be online 14 hrs in a day on this website so probably Mai multe

$773 CAD în 20 zile
(0 recenzii)
3.4
ashanfw

Hi, I'm an IT guy by trade and handle most aspects of IT in general from data entry to network and server administration. I would like to work for myself and on my own terms and shall use my ever growing profession Mai multe

$388 CAD în 10 zile
(0 recenzii)
0.0