Crawl iTunes API and Insert into ElasticSearch

Închis Postat la acum 4 ani S-au achitat serviciile după ce au fost prestate
Închis S-au achitat serviciile după ce au fost prestate

I currently have a script that parses the iTunes API and puts the data into ElasticSearch and Cassandra databases. It crawls the RSS feeds twice per day. It checks iTunes for new Podcasts every day as well. Here is an example of an RSS feed that it parses.

[login to view URL]

So there are Podcasts which are like audio shows. And then each Podcast has multiple [login to view URL] other words each Podcast has one RSS feed and each RSS feed shows the episodes for that podcast sorted by newest release date first.

The current developer of the script is not very responsive to making changes. So your job is to

1 - There are some parse errors for some of the podcast rss feeds.

2 - We are missing a lot of podcasts from iTunes. We can get some of those from another websites API.

3 - Setup data for each podcast regarding how often they release new episodes. We can determine their frequency by just looking at the RSS feed and storing the frequency in the database. For example for those that have a frequency of once per day or multiple times per day we should crawl every hour of the day. For those that are once per week we should crawl maybe 4 times per day etc...

I will give you the code so you can understand it and also talk with one of my other engineers who knows how it works also.

The code is written in python. You must also show me expertise in elasticsearch

Thank you

Python Node.js Elasticsearch Cassandra Podcasting

ID Proiect: #20952504

Detalii despre proiect

9 propuneri Proiect la distanță Activ acum 4 ani

9 freelanceri plasează o ofertă medie de 367$ pentru proiect

zekovicm

Hi there,I am Web Scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this project ! I can start immediately and finish it within the Mai multe

$322 USD în 7 zile
(29 recenzii)
6.1
liveexperts123

Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the Mai multe

$400 USD în 3 zile
(16 recenzii)
5.9
whiteeagle0001

Hello, How are you? I have read your description in more detail and have much interest in your project. So I think that I can finish your work perfectly as you need.I have many experiences for Node js. If you need to Mai multe

$400 USD în 7 zile
(20 recenzii)
4.6
zeke

I have lots of experience writing web automation scripts using scrapy and with elasticsearch too. Available to start immediately and finish as soon as possible. Please contact to discuss details if you are interested. Mai multe

$250 USD în 7 zile
(13 recenzii)
4.9
umairkaramat24

Hello There. How are you doing? I have read the description, I have great experience doing similar jobs related to these skills Cassandra, Elasticsearch, node.js, Podcasting, Python. Please start the chat so we can hav Mai multe

$280 USD în 13 zile
(6 recenzii)
3.3
sharktiger

Good day! I'm a licensed full stack programming developer and designer. I have many experiences in python/Django and python selenium webscraping and python image processing by using python openCV package. I have many Mai multe

$250 USD în 7 zile
(2 recenzii)
3.2
jaymaninfotech2

JAYMAN INFOTECH PVT LTD is a contemporary Website design and development company with a focus on user-centered design while helping our clients achieves the desired result. we are a custom software development company Mai multe

$850 USD în 45 zile
(0 recenzii)
0.0