Find Jobs
Hire Freelancers

Wikipedia data dump miner

$15-25 USD / hour

Închis
Data postării: aproape 7 ani în urmă

$15-25 USD / hour

I 'm looking for wikipedia and machine learning expert. - Are you an expert Wikipedia dump files? - Do you love to write scripts that automate extractions? - Which scripting languages do you already know? Python, Bash? - Work closely with our teams building user experiences and collaborative machine learning algorithms. What do you think of this fist task 1. given two languages , say en and zh. 2. and a page category , like Living people. 3. and a specified WP dump date. 4. generate a set of sets of name string. where each set has all of the en redirects and zh redirects for a given pair of en-zh linked titles. For example, the set for Vladimr Putin's page would have all his redirects in English as well as his page name in Chinese and all of its redirects. If you like that as a starting task, please give me an hour estimate for it and we can start a contract with that as the first task going forward we have a bunch of tasks of this kind.
ID-ul proiectului: 14845502

Despre proiect

10 propuneri
Proiect la distanță
Activ: 7 ani în urmă

Vrei să câștigi bani?

Avantajele de a licita pe platforma Freelancer

Stabilește bugetul și intervalul temporal
Îți primești plata pentru serviciile prestate
Evidențiază-ți propunerea
Te înregistrezi și licitezi gratuit pentru proiecte
10 freelanceri plasează o ofertă medie de $21 USD/oră pentru proiect
Avatarul utilizatorului
I can write a script using Python's request library that will generate a set of sets of name string, based on your specified criteria. The request library is really powerful and allows features such as persistent sessions (for fast querying). I can complete the initial task in 6 hours. If you are interested, we can talk via chat and I can tell you more about my previous (similar) work!
$25 USD în 20 zile
5,0 (8 recenzii)
4,5
4,5
Avatarul utilizatorului
i would like to offer you my expertiseas I have done number of my academic projects and I am a professional in the field Contact me and I’ll show you what i am capable of
$22 USD în 40 zile
4,0 (18 recenzii)
3,1
3,1
Avatarul utilizatorului
I'm CTO at datascraping [dot] club, we provide data scraping and websites scrapping services, have a lot of experience with machine learning and data scrapping in general. Would love to chat about your project and share my experience. Thanks
$22 USD în 40 zile
5,0 (1 recenzie)
2,2
2,2
Avatarul utilizatorului
Hello, my name is Michael. I represent Ukrainian based IT-company Webbook Inc that provides services in the IT-sphere for international business. We were carefully reviewing the requirements of the job description, so our devs can work on Your project without delay. We have years of working on projects related on any available CMS, from "scratch" with core php and php-frameworks(Yii/Yii2, Laravel, CodeIgniter), JavaScript, jQuery, AJAX, HTML5, CSS3, Bootstrap, javascript-frameworks, 3d desidg, graphic design etc. However, I shall discuss about the requirements and functionalities in details to have a better understanding about time frame and price. We are glad to chat with You and discuss all in details. Contact us and we will reply immediately. Waiting for Your reply! Best regards, Webbook team
$22 USD în 40 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I have hands on expertise in python ( beautiful soup ) web crawling, I am also a data engineer where day job involves creating data pipelines for extraction, transformations.
$27 USD în 30 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I have been editing Wikipedia more then 3 years, also I have use Pywikibot with my own scripts. Beside that, I love everything related to Wikipedia and I will do this job with love :).
$15 USD în 40 zile
0,0 (0 recenzii)
0,0
0,0

Despre client

Steagul CHINA
Beijing, China
5,0
1
Membru din apr. 7, 2017

Verificarea clientului

Mulțumim! Ți-am trimis prin e-mail linkul pe care trebuie să-l accesezi pentru a revendica creditul gratuit.
A apărut o eroare la trimiterea e-mailului. Încearcă din nou.
Utilizatori înregistrați Totalul proiectelor postate
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Se încarcă previzualizarea
S-a oferit permisiunea de depistare a locației.
Ți-a expirat sesiunea pentru conectare sau te-ai deconectat. Conectează-te din nou.