Find Jobs
Hire Freelancers

For PaulWalton

$100-300 USD

Abgesagt
Veröffentlicht vor etwa 18 Jahren

$100-300 USD

Bezahlt bei Lieferung
1. I must be able to set the starting URL from which the spider will intitiate from on the [login to view URL] or [login to view URL] websites. The [login to view URL] website is powered by [login to view URL] and is in the same format but I find the navigation to find various categories easier with 411.com. For example I might paste the following URL into the spider utility: [login to view URL];C=jewelers&R=N&STYPE=S&MC=1&OO=1&F=1&CP=Clothing+%26+Accessories%5EJewelry%5EJewelers%5E 2. Once the starting URL has been entered, the spider must parse the HTML and extract the business name, city, state, zipcode, telephone number, fax number (if applicable), email address (if applicable), and website (if applicable) into a CSV formatted text file. 3. Spider must crawl through each of the pages until the final page for that category is completed. However, at the very beginning of most categories, there are businesses listed under the "Yellow Pages - Advertisers" heading. These are businesses that are not from the area that I have chosen (for example I chose Alaska and they are from California, etc.) but are advertising in that area. I do not want these entries included. I would want the ones that start under the "Yellow Pages - Listings" heading. The spider does not neccessarily need to know how my list was created, only to avoid entries under the "Advertisers" section. 4. When completed, an update function that lets me name a new file to save the data to or lets me choose an exisiting .CSV file to append the new data to. 5. Search and purge function that can be run anytime on any of the .CSV files that have been created to ensure no two entires have the same telephone number in a specific .CSV file. If duplicates telephone numbers are found, records with the least information are automatically deleted. For example, 2 records wit the same telephone numbers but one lists a fax and the other doesn't, then delete the one without the fax number. 6. Merge function that can be run any time and lets me pick 2 or more .CSV created files and merge them into one new file. If more than 2 files is a problem, I can live with 2 and merge a few times to create one file. 7. Finally, I will provide you with 2 URL's which will represent 2 different yellow page categories on the [login to view URL] website and you will run the completed program and email me (or make available to download), 2 .CSV files with the completed and duplicate purged files. My Requirements: 1. You will be easily contacted. Either by phone, or you will be required to answer any e-mail I send to you within 10 hours time. 2. Must speak and write english well. 3. Code must be well commented in english. 4. All source code must be given to me. 5. I would prefer if this was written in a common programming language. 6. I would like this done by no later than March 10th, 2006. 7. Must be able to run on my Pentium III with Windows XP. I am in a very rural area and only have dial up. 8. Delivery of files will be via email for sure and possibly by FTP. 8. A $50 bonus (subject to any fees from [login to view URL]) above the agreed bid price if the entire project is completed and delivered to me by March 6th, 2006.
Projekt-ID: 46699

Über das Projekt

Remote Projekt
Aktiv vor 18 Jahren

Möchten Sie etwas Geld verdienen?

Vorteile einer Ausschreibung auf Freelancer

Legen Sie Ihr Budget und Ihren Zeitrahmen fest
Für Ihre Arbeit bezahlt werden
Skizzieren Sie Ihren Vorschlag
Sie können sich kostenlos anmelden und auf Aufträge bieten

Über den Kunden

Flagge von CANADA
Charlottetown, Canada
5,0
11
Zahlungsmethode verifiziert
Mitglied seit Feb. 27, 2006

Kundenüberprüfung

Danke! Wir haben Ihnen per E-Mail einen Link geschickt, über den Sie Ihr kostenloses Guthaben anfordern können.
Beim Senden Ihrer E-Mail ist ein Fehler aufgetreten. Bitte versuchen Sie es erneut.
Registrierte Benutzer Veröffentlichte Jobs
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Vorschau wird geladen
Erlaubnis zur Geolokalisierung erteilt.
Ihre Anmeldesitzung ist abgelaufen und Sie wurden abgemeldet. Bitte melden Sie sich erneut an.