Scraping web content with Selenium in Java

În desfăşurare Postat la acum 6 ani S-au achitat serviciile după ce au fost prestate
În desfăşurare S-au achitat serviciile după ce au fost prestate

I want to download the content of a website but it's is a single page site (like instagram) and I cant just download an html file. We have to programmatically access the website through a browser (preferably in a headless environment like a Linux machine running in aws) and then use xpath to find the information we need and save this information somewhere (text files would be ok as long as the data is consistent).

Assuming we continue with our Instagram example, the objective is to take a snapshot of all the information in a public account. This can be split into several steps:

* Start by going to the main page of an Instagram user and get the number of posts, followers, following and bio/description

* Get the full list of followers (instagram ids). Note this requires a sub-window and scrolling down.

* Get the full list of following (instagram ids). Note this requires a sub-window and scrolling down.

* Then for each post get a link to the post, a link and maybe a thumbnail of the image, the number of likes, the number of comments, the list of hastags, the list of people who commented, the list of people mentioned and possibly the text in the post plus all the comments. Note that since this has to be done for all posts we will need to scroll down the page all the way to the end.

This is a side project for me and I will have to maintain the code so very simple and easy to read source code is preferred. With plenty of comments explaining why things are done. Please take that into account when making an offer. Unit tests would be a very nice addition, it would be great to include them (starting from the very obvious to more complicated cases).

A payment schedule will be established for each one of the steps, starting from the most basic of just getting the number of posts, number of followers, number of following and bio/description.

Please only apply if you are very familiar with this kind of work. I am software engineer myself and although I am not familiar with the details of web automation I do a very decent job reading java code.

Java JUnit Selenium Webdriver Web Scraping xpath

ID Proiect: #16630199

Detalii despre proiect

18 propuneri Proiect la distanță Activ acum 6 ani

18 freelanceri plasează o ofertă medie de 188€ pentru proiect

seaanddream

Hi, my name is Sevinc. My bid covers all the phases you explained above... I read your "Scraping web content with Selenium in Java" project descriptions carefully before bidding. I checked your requirements as well... Mai multe

€500 EUR în 7 zile
(258 recenzii)
7.7
zekovicm

Hi there,I am Miljan,Web Scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this job ! I can start immediately and finish it within Mai multe

€222 EUR în 3 zile
(79 recenzii)
7.0
phpXpertbd

Dear Sir, I'm very much delighted to let you know that i did data scraping with PHP-cURL, PhantomJS, Node.js, Selenium from many sites. I just scraped the data from web site and then wrote the data in mysql database Mai multe

€100 EUR în 3 zile
(66 recenzii)
7.2
abedin94

hi , I have scrapped 100+ websites . Amazon , adidias and many more r among them . knk me so that we can discuss . I can do this using python not java , if language is not a problem for me then knk me up .

€166 EUR în 3 zile
(116 recenzii)
6.4
schoudhary1553

Hi there..... Warm Greetings We came along with your request for Scraping web content with Selenium in Java and we reviewed your project description. We'd like to help you with confidence and satisfying results.. Mai multe

€300 EUR în 5 zile
(26 recenzii)
6.0
olegivanoviam

Hi. My name is Oleg and I am senior C#/.NET software developer from Ukraine. Please contact to discuss details regarding the project.

€250 EUR în 3 zile
(3 recenzii)
4.5
fertroya

Hi, I'm interested in this job also as a side job. In my current job wehave selenium with chrome web driver in a headless environment running in ec2 already setup. We use this environment to run our end to end tests. Mai multe

€166 EUR în 3 zile
(15 recenzii)
4.4
DarkKnight2206

I am a python developer. I have great experience in web scraping and I am an expert in it. I have all necessary skills to scrape almost any website. I have even scraped sites like google, whatsapp web, instagram which Mai multe

€125 EUR în 2 zile
(14 recenzii)
4.8
sonaligupta334

A proposal has not yet been provided

€155 EUR în 3 zile
(2 recenzii)
2.2
WebStar426

I am experienced SELENIUM Expert. Hi, I’ve carefully gone through your job posting. I am Web Scraping Expert.... I will get all data you want. I can scrapy even Ebay site. I can work with you anytime you want. Mai multe

€222 EUR în 3 zile
(2 recenzii)
1.6
VirtualBrainInc

Hello, I have briefly read the description on Scraping web content with Selenium in Java development, and I can deliver as per the requirements however I need us to discuss for more clarity on the details, deadline Mai multe

€155 EUR în 3 zile
(2 recenzii)
1.2
sergsyuzev

Hello. I spend a lot of time with python+selenium and I can do it for you. I need more details about your task. Best regards, Serge

€166 EUR în 5 zile
(0 recenzii)
0.0
pradeepta02

I have more than 10 years of experience in automation testing using selenium/webdriver, Java, Python, Appium etc. My specialities include building automation frameworks from scratch, writing automated test scripts and Mai multe

€155 EUR în 3 zile
(0 recenzii)
0.0