Parsing of 5GB dump of clean data (estimated 1 day of work)

1. I have a 5GB dump of crawled data that needs to be parsed.

2. It's contact data: name, workplace, phone number, fax, etc etc.. 250k contacts in total.

3. The data is very clean, all from ONE source, but it is pretty complex.

4. Each contact has many attributes, like 1 workplace, n skills, n references to other contacts in the database (colleagues, co-workers in the same workplace, co-worker in the same department etc.)

I need this data parsed. It is currently dumped into MongoDB and we have already written a shell script parsing much of the contact information, but not all. If you want you can continue with this script, or you can start over and create your own.

This should not take longer than a day of work if you know what you are doing.

This is a fair estimate from a developler who has crawled it and started parsing, but now doesn't have the time to complete the job.

There is no rush, so you can work on this over the next 3 weeks.

Fähigkeiten: NoSQL Couch & Mongo, Perl, PHP, Shell Script, SQL

Mehr darüber work references, what is mongodb, create shell script, data processing, python, web scraping, perl, data entry day work, day trial free work home data processing, work collections data, free freelance work home data entry hyd, work smart data entry, cover designers day work, translation english spanish good day work, work home data entry usa, work home data entry bangalore, work submitted data chronoform

Über den Arbeitgeber:
( 22 Bewertungen ) Berlin, Germany

Projekt-ID: #9141316

41 Freelancer bieten im Durchschnitt $233 für diesen Job


I am an expert in delivering customized scripts and look forward to discuss further about the project needs.

$315 USD in 3 Tagen
(151 Bewertungen)

I'm expert in data parsing / processing with many years of experience that's why I'm sure you'll be impressed with my work. I can process all of your information fast and I can offer you best price here. Please show Mehr

$166 USD in 2 Tagen
(451 Bewertungen)

hi, please attach sample crawled data and list out all the fields to parse out I can write a Perl script to extract the crawled data into MySQL database/CSV file

$388 USD in 14 Tagen
(62 Bewertungen)

Hello, I can do it with perl script. But I need have sample part of data for review srtucure and test parser Regards, Dmitry

$140 USD in 3 Tagen
(150 Bewertungen)

Hello, very interested. Have high experience with big data csv and xml parsing using perl and regular expressions. Last time finished movies data huge csv file import into the mysql database. Please provide your du Mehr

$250 USD in 7 Tagen
(131 Bewertungen)

Have a sample of input data? And what is the output you need? .

$444 USD in 21 Tagen
(71 Bewertungen)

Hello Sir. I have read your requirement totally. Yes, i have worked on this type of work. Lets see my website Im parsing data from target page. getting titles, articles, images, except. Please che Mehr

$200 USD in 7 Tagen
(59 Bewertungen)

Hi 1. Any chance to get more information about dumped data? Is it HTML pages or something else? 2. If data is HTML can you provide one page as example and its complexity estimation. 3. Should extracted data be st Mehr

$133 USD in 2 Tagen
(100 Bewertungen)

Hey i have a couple of questions can we talk?

$444 USD in 3 Tagen
(16 Bewertungen)

We have a team of UNIX experts with more than 10 years of rich industry experience & would be more than happy to work on the project. We have written our own crawler in JAVA and would be the best fit for the projec Mehr

$309 USD in 3 Tagen
(11 Bewertungen)

I'm a computer science professional with a PhD degree and I have extensive experience in databases, Perl, PHP, and several other programming languages. Please see reviews on my profile. It would be my pleasure to do th Mehr

$120 USD in 5 Tagen
(22 Bewertungen)

Hi, I have more than 8 years experience in web scrapping with Python, casperJs, phantom Js, golang, Perl and Php. In Perl I have implemented a channel manager, completed more than 600 hotels site for updating their Mehr

$368 USD in 3 Tagen
(6 Bewertungen)

A proposal has not yet been provided

$110 USD in 10 Tagen
(38 Bewertungen)

Hello More 20 years programming experience. I would like suggest perl for parsing, or, if productivity is important, then pure C, but I need more details to set real price and time. Regards. ----------------------- Mehr

$100 USD in 5 Tagen
(37 Bewertungen)

A proposal has not yet been provided

$260 USD in 21 Tagen
(27 Bewertungen)

-> I am Interested and would love to work on your project. -> I read through the job details extremely carefully and I am absolutely sure that I can do the project very well. -> I am 4+ years experienced Web Develo Mehr

$444 USD in 3 Tagen
(42 Bewertungen)

Hi, if possible that you export you data to a csv or text file it will be more easy to parse the data correctly. I will download the file and parse the data using perl scripts. Best regards, Ilirjan

$166 USD in 5 Tagen
(16 Bewertungen)

I have exoperienced in creating sql queries,Dynamic sql queries,SSIS Packages and ,Store procedures and SSRSreports

$222 USD in 3 Tagen
(21 Bewertungen)

Hi, I, based on my 5 years experience as a software engineer knowledgeable with unix and linux administration expert on commandline application, can handle this task pretty well. Let me know the best of your time so w Mehr

$100 USD in 0 Tagen
(12 Bewertungen)

Hi i have worked with many big files in low memory. i am al so experienced in map reduce in mongodb and counchdb. so this will be a easy task for me. i am ready to start work now. you need to pay only if you 100% sati Mehr

$100 USD in 3 Tagen
(11 Bewertungen)