Find Jobs
Hire Freelancers

Build me a general domain Polish-English termbase

$30-250 USD

Închis
Data postării: aproape 5 ani în urmă

$30-250 USD

Plata la predare
PROJECT GOAL The goal of the project is to create a bilingual (Polish & English) termbase with minimum 100,000 entries (general domain), which would be uploaded into MemSource CAT software and used during translations. This database needs to contain two sets of columns a) Polish term b) English term. No characters nor operators are allowed - just the actual terms. The difficulty here is finding and processing the input information. INPUTS Possible readymade databases like: - online dictionaries - dictionaries on USB stick, CD OUTPUT FILE STRUCTURE The basic structure of the output file needs to be this: column A: Polish term column B: English term column C-X: second and next (if available) English term EXPECTED PROPOSAL 1. How to find the right, high quality data. You also have an idea and technical skill to obtain that data in required quantity and quality. 2. Technology you would use. 3. Describe the process you would follow. 4. Describe expected outcomes. IDEAL CANDIDATE: 1. Self-reliant, self-starter 2. Highly experienced in data analytics and/or data science 3. Great mathematical problem solver capable of building complex algorithms ACCEPTANCE CRITERIA 1. Any row needs to consist of at least one Polish term and a corresponding, at least one English term. 2. There is a minimum of 100,000 VALID entries (Polish term & English term is 1 entry). 3. Source of data is revealed to me so I can do a quality check and approve it. 4. Polish and English terms are clearly separated with an operator or placed in different column. 5. Subsequent english terms are divided by a comma “,” OR and a semicolon “;” OR are placed in different columns of the same row. 6. Abbreviations like “mat.” or “chem.” denote subject matter areas and are redundant - should be ignored and excluded from output file. 7. No entries shorter than 3 letters (1 and 2 letters long). 8. No blank rows between rows. 9. Spot-check your work - I will spot-check random 1000 entries to ensure proper quality and structure.
ID-ul proiectului: 19819497

Despre proiect

Proiect la distanță
Activ: 5 ani în urmă

Vrei să câștigi bani?

Avantajele de a licita pe platforma Freelancer

Stabilește bugetul și intervalul temporal
Îți primești plata pentru serviciile prestate
Evidențiază-ți propunerea
Te înregistrezi și licitezi gratuit pentru proiecte

Despre client

Steagul POLAND
Poland
0,0
0
Metoda de plată a fost confirmată
Membru din iun. 24, 2015

Verificarea clientului

Mulțumim! Ți-am trimis prin e-mail linkul pe care trebuie să-l accesezi pentru a revendica creditul gratuit.
A apărut o eroare la trimiterea e-mailului. Încearcă din nou.
Utilizatori înregistrați Totalul proiectelor postate
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Se încarcă previzualizarea
S-a oferit permisiunea de depistare a locației.
Ți-a expirat sesiunea pentru conectare sau te-ai deconectat. Conectează-te din nou.