ASP function to extract text content from a single page
$30-50 USD
Finalizat
Data postării: circa 13 ani în urmă
$30-50 USD
Plata la predare
I need a function to extract text from a single page. The function will be given an URL as a parameter, and must return the text in the page.
The URLs are collected from Google News, so all pages contain articles. The function will have to detect where is the article (by checking big portions of text without tags, for example), remove the tags from the text and return the title of the article and the plain text of the article.
In some cases, the URL takes to a page where there's another link and no more information. The system must detect this and follow this second link to get to the article.