Find Jobs
Hire Freelancers

Update an already existing web scraping tool and gather data for Startups database. It needs to gather data from known sites

€150-200 EUR

În desfășurare
Data postării: aproape 4 ani în urmă

€150-200 EUR

Plata la predare
1st of all - apologies for the change of budget, the project description is totally different - it is just a modification/upgrade of an existing scraper, not a dev of a new one :) NOT A BIG PROJECT, BUT AN INTERESTING ONE FOR SURE :) THE WEBSCRAPER IS ALREADY DEVELOPED ACCORDING TO THE INSTRUCTIONS BELOW, BUT NEEDS TO BE UPGRADED (GUI-UX & SOME FUNCTIONS) (you will find the dev files and documentation in the attached zip file). You can also check the project out on GitHub ([login to view URL]). We need it to upgrade it so that it can adapt to the changes of all the websites it needs to scrape data from and also we want to add this website : www (dot) startupblink (dot) com web scraping tool and gather data for Startups database. It needs to gather data from known sites (more info in attached documents): Web scraper should be capable to gather basic info as a lead, such as Startup name and some contact information. Possibly startup description and logo Web scraper should accept an URL parameter (where to scrap for the data) and depth level (how deep scraper should dig, e.g. how many sub-links, sub-sections per URL and whether should scraper go outside specified URL, e.g. follow external links) In later stage, same web scraper should be capable to be configured to search for additional leads other than startups - such as: investment entities, service providers, etc... Background and strategic fit. All scraped info should be saved into two databases (startups max 3 years old), other companies. There should be a simple way to convert the DBs into CSV files. This web scraping tool should be configured in such a way that admins can insert starting URL and define what are they looking for, among, for example: startups, investment entities, service providers, etc... as well as list of data they are looking for, such as: company(startup) name, contact data, descriptions, and/or other properties. From tech perspective, the tools should use some already made Web Scraper, regardless of it's tech. stack... There are some pretty cool Java, Python and Node based web Scrapers. From tech perspective, it must be easily deployable tool not requiring some additional server resources or specific infrastructure stack which would create an overhead. Basically, what ever can be run from a container or similar environment could work for us, for as long as it is not resource-hungry and cost a lot when operating. When scraping tool is started, it should find required data from specified URL, then check do we already have found data in our databases, and if not, it should save it into our Startup database Assumptions 1 Starting URL As an operator I want to be able to input starting point (URL) for web scraping MUST HAVE Operator inputs starting URL for scraping 2 Search params As an operator I want to be able to input parameters I am looking for MUST HAVE Operator inputs what type of data, properties is looking for, such as: startup name, startup contact data, startup descriptions, startup logo The params should be added dynamically because they will vary from URL to URL Each searching param should accept multiple selectors... On some websites Startup name is titled as "startup name" while on others as "company name" or just "name"... We need to be able to define multiple params names and group them into single title. 3 Depth level As an operator I want to be able to input the depth level for my starting URL MUST HAVE Operator can select depth level for scraping, choosing from dropdown with values "1, 2, 3, 4, 5, any" defining how deep scraper should dig the starting URL 4 Follow External links As an operator I want to be able to choose whether my scraping tool should follow any external links from my starting URL MUST HAVE Operator choose Yes or No User interaction and design The tool need to have very simple interface for the operators and it requires authorization before the tool can be used.
ID-ul proiectului: 25625798

Despre proiect

11 propuneri
Proiect la distanță
Activ: 4 ani în urmă

Vrei să câștigi bani?

Avantajele de a licita pe platforma Freelancer

Stabilește bugetul și intervalul temporal
Îți primești plata pentru serviciile prestate
Evidențiază-ți propunerea
Te înregistrezi și licitezi gratuit pentru proiecte
11 freelanceri plasează o ofertă medie de €344 EUR pentru proiect
Avatarul utilizatorului
@Hello!@ I have read your description and understand your idea. I have also checked your attachment files. Web scrapping and auto script are my favorite skill. I did so many scrapping and auto script projects using python selenium, beautiful soap and panda. I can satisfy your all requirements perfectly. I am sure I can offer good result and fast delivery because I have good experience in this field. I can show u my previous works if u want. I don't bid on any projects which I can not do. your job is suitable for my skill. let's contact to discuss in detail. best regard!
€500 EUR în 7 zile
4,9 (18 recenzii)
6,5
6,5
Avatarul utilizatorului
Hello. As a web scraping and data mining expert by python and node.js, selenium web driver, I am glad to place the bid on your project. I have experienced LinkedIn profile scraping, amazon products scraping and ticket pricing tracking and so on. I want to discuss more via chat. Regards. Vladimir
€400 EUR în 7 zile
4,8 (24 recenzii)
5,2
5,2
Avatarul utilizatorului
Hi, there. I have read your description carefully. I am very interested in your web scraper updating project. I have rich experience with web scraping using Python and PHP. Looking forward to hearing from you. Best wishes.
€175 EUR în 3 zile
5,0 (5 recenzii)
3,6
3,6
Avatarul utilizatorului
I have been working with Software Developer for more than 08 years. I’ve come to know that you are looking for a software developer expert who knows the work very well. I want to let you know that I can fulfill your requirements properly as I’ve the experience working in this sector. I’ll be able to complete your work in time without making any mistakes. I have also checked your time schedule and I can ensure you that time won’t hamper your work. I’ve gathered experiences over the years. So I don’t think you will regret it if you consider me for this job. If you like to know about my skills and experiences then visit my profile and read the reviews given by the old clients. Hoping that I’ll also be able to satisfy you. Also please check my portfolio that is 100% similar to your job posting. So, give it a thought and I’m eagerly looking forward to working with you. Please call me for the interview if you would like me to give a chance. I am available in any kind of communication software to make this project successful. Thanks Sunil BG
€600 EUR în 7 zile
1,0 (1 recenzie)
1,3
1,3
Avatarul utilizatorului
Hi sir, I'm a Professional Person in this field, i can do this job efficiently. Why me ? Strict Confidentiality I won't provide client's data to any one You will get me The work will not be outsourced to anyone - Unlimited revision until acceptance Quick response I will complete my work within deadlines
€250 EUR în 4 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I am interested for this job
€556 EUR în 25 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
I will send you a set of lecture slides and notes and you will need to summarize and make them into concise notes.
€194 EUR în 10 zile
0,0 (0 recenzii)
0,0
0,0
Avatarul utilizatorului
Hi, there. I have read your description carefully. I am very interested in your web scraper updating project. I have rich experience with web scraping using Python Looking forward to hearing from you. Best wishes.
€175 EUR în 7 zile
0,0 (0 recenzii)
0,0
0,0

Despre client

Steagul SLOVENIA
Ljubljana, Slovenia
5,0
20
Metoda de plată a fost confirmată
Membru din nov. 27, 2018

Verificarea clientului

Mulțumim! Ți-am trimis prin e-mail linkul pe care trebuie să-l accesezi pentru a revendica creditul gratuit.
A apărut o eroare la trimiterea e-mailului. Încearcă din nou.
Utilizatori înregistrați Totalul proiectelor postate
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Se încarcă previzualizarea
S-a oferit permisiunea de depistare a locației.
Ți-a expirat sesiunea pentru conectare sau te-ai deconectat. Conectează-te din nou.