Crawl Domain Find Expred Domains -- 2

Geschlossen Veröffentlicht vor 4 Jahren Bezahlt bei Lieferung
Geschlossen Bezahlt bei Lieferung

We are looking for a crawler to crawl every page of a website looking for external links pointing to expired domains.

User should definde a list of sites to crawl via text file. Crawler should work logically crawling all pages of a site and not be sitemap dependent. Only unique external domains should be logged to prevent duplicate domain availability lookups.

User should also be able to define a list of urls to ignore checking for availability; eg. [login to view URL] etc. these domains should be user defined in a blacklist text file.

Results should be given in a csv file listing linking domain and available domain.

Python Web Scraping

Projekt-ID: #19765484

Über das Projekt

6 Vorschläge Remote Projekt Aktiv vor 4 Jahren

6 Freelancer bieten im Durchschnitt $28 für diesen Job

chirgeo

Hi. I did read the project description and have a few questions. 1. Do you need the script as well or data only? 2. What is the format of the output data? CSV is OK? We can do other formats as well. 3. Which fields do Mehr

$100 USD in 5 Tagen
(110 Bewertungen)
7.3
smsaurabhv

‌Hi, I have gone through your requirement to scrape lots of websites. I am EXPERT in building scraping tools /scripts. Hence, I can SURELY work on your project. I am having 4 YEARS of EXPERIENCE in developing PHP-PYTHO Mehr

$15 USD in 3 Tagen
(51 Bewertungen)
4.9
techlinesols6

Dear Prospect Hiring Manager. Thank you for giving me a chance to bid on your project. i am a serious bidder here and i have already worked on a similar project before and can deliver as u have mentioned "I can do th Mehr

$13 USD in 7 Tagen
(1 Bewertung)
0.0
hienhdt32

I have experiment in crawling data using be4, scrapy,... with python, extract data to xml, json,... Contact me!

$20 USD in 2 Tagen
(0 Bewertungen)
0.0
junadatar947

Hi there JUNA here. I understand that you need a crawler or A SPIDER for scraping expired domain but my question is will you provide the list of domains that you need to check for the availability otherwise this sho Mehr

$12 USD in 3 Tagen
(0 Bewertungen)
0.0