Find Jobs
Hire Freelancers

Build data extract and transformation tool -- 3

$250-750 USD

Abgesagt
Veröffentlicht vor mehr als 8 Jahren

$250-750 USD

Bezahlt bei Lieferung
We are in need of a data extraction (E) + transformation (T) tool -- the importer -- that converts information stored in various file types (such as .csv, .xls, .xlsx, .mdb, .accdb, etc.) and structures (wide, long, dyadic, etc.) into a standardized .json format based on an already established data schema and built off an existing prototype. Information on the json data schema can be found here: [login to view URL] The data extraction ( E ) logic is how to generate the flow of events from data within each source file; and the data transformation ( T ) logic is how to serialize the data in the flow to separate JSON objects. The JSON output of the E+T tool will eventually be uploaded into a cloud-based data aggregation tool, aka. DataGator, that is currently under development. The full-fledged importer tool needs to be robust enough to identify different variants of standard data fields and records, including non-standard characters within the data as well as within the file names, and reformat and package these into the final .json format that the system, aka. DataGator, recognizes. We are looking for developers with expertise in Java and data processing. Familiarity with Jackson/Apache POI would also be advantageous. This project has the possibility of becoming a longer-term arrangement for the development of additional auxiliary components for the DataGator project. The final iteration of this tool will be included in a public rollout for the DataGator system. To facilitate candidate developers to understand our requirements, we have created a prototype of the importer tool. A working example is provided within the prototype that can convert a single .csv file to JSON output. The prototype of the importer tool can be found in our public GitHub repository, [login to view URL] The work that needs to be completed would be to extend this prototype both horizontally (i.e. extract data from other file types such as .xls, .xlsx, .mdb, .txt) and vertically (i.e. support data filtering, merging, and hierarchical extraction, i.e. selecting multiple work sheets within a single Excel files, one sheet in multiple Excel files, multiple sheets within multiple Excel files etc.). There are multiple modules that need to be constructed based on file types and additional functionality NOTE: This project is only for Module 3 (XLSX Extractor) and all additional modules will be completed under follow-on projects dependent on successful completion of Module #3. Module 1: Build E+T functionality for .xlsx file Module 2: Build E+T functionality for .xls file Module 3: Build E+T functionality for .mdb file Module 4: Build E+T functionality for .txt file Module 5: Allow for processing of multiple files Module 6: Create a Java GUI for client-side operation Note: The logic for multi-sheet xls / xlsx/mdb files can have three alternative options (i) extract data from one of the sheets (ii) extract data from multiple sheets and combine them into a single JSON object (iii) extract data from multiple sheets and output them as individual JSON objects. Our expectation is that Module 3 (XLSX Extractor) will be able to be completed within 3-4 days of being awarded the project and the freelancer is qualified to complete Modules 4 through 6. We have provided several sample files that will need to be converted to JSON format and eventually uploaded, however it must be understood that these sample files are not an exhaustive list of characters within a file or file structures. The ideal freelancer will be willing and able to identify potential issues that may arise as other users begin to use it and different file structures are loaded into the tool. We are looking for freelancers that are passionate about this project and interested in a longer term working relationship. Please reach out with any questions that you may have.
Projekt-ID: 9136203

Über das Projekt

11 Vorschläge
Remote Projekt
Aktiv vor 8 Jahren

Möchten Sie etwas Geld verdienen?

Vorteile einer Ausschreibung auf Freelancer

Legen Sie Ihr Budget und Ihren Zeitrahmen fest
Für Ihre Arbeit bezahlt werden
Skizzieren Sie Ihren Vorschlag
Sie können sich kostenlos anmelden und auf Aufträge bieten
11 Freelancer bieten im Durchschnitt $562 USD für diesen Auftrag
Avatar des Nutzers
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$250 USD in 5 Tagen
5,0 (61 Bewertungen)
6,1
6,1
Avatar des Nutzers
A proposal has not yet been provided
$515 USD in 10 Tagen
4,5 (32 Bewertungen)
6,2
6,2
Avatar des Nutzers
Hi, I have gone through the details, understood your objective and plan, and checked the data schema. I can deliver this in quick incremental iterations. I am a software architect with hands-on skills, with good expertise in integration. Please let me know if we can discuss more. Happy holidays. regards, kD
$722 USD in 20 Tagen
4,9 (26 Bewertungen)
5,5
5,5
Avatar des Nutzers
I am a sun certified java programmer with over 12 years of experience in designing and developing applications. My clientele includes tier-1 companies in banking and insurance domain in the UK. Having extensive experience in delivering ETL projects using apache libraries and spring integration framework I believe makes me the prefect match for this project. Lets have a chat and take this forward. Look forward to work with you. Thanks in advance.
$611 USD in 10 Tagen
4,8 (9 Bewertungen)
3,8
3,8
Avatar des Nutzers
Hi there, I have strong reasons to believe that I'm a suitable for the job: 1. I've been working with JAVA programming for 8 years and I've worked with Apache POI for so many times. I know exactly how to deal with .xlsx and .xls files, both reading and writing (reading in your case) 2. We are always using github in our company and we do things properly (branching, pool requests etc.) 3. I will write regression tests before implementing the details (TDD) and promise a high test coverage (80%+) 4. I have done data migration for many times from one data source to another, including from excel files of cause. 5. I'm a strong and active communicator and I try everything I can to avoid misunderstanding. 6. I've worked with RxJava and other stream based solutions to completely avoid out of memory issue for large scaled data, so if your .xlsx file is big I can guarantee you that it wouldn't crash while transferring the data. 7. Just for clarification, I'm not someone who attracts clients with low price, I'm new to freelancer but I'm an experienced developer with solid skills. And I value reputation and long term relationship instead of some one-off type. I believe you want some person to do this project properly and continue with the rest of the data source instead of finding another one for another data source. That's a good place to get started. 8. I don't mind working during Christmas, it's so hot in Australia and I don't want to go anywhere. Merry Christmas! Cheers, Daniel
$250 USD in 10 Tagen
4,9 (4 Bewertungen)
2,3
2,3
Avatar des Nutzers
A proposal has not yet been provided
$555 USD in 10 Tagen
5,0 (1 Bewertung)
1,6
1,6
Avatar des Nutzers
I am a seasoned software professional who works on data transformation every day. I work on my day job on the WSO2 ESB which has alot of trasnformational processing. I have worked with POI in the past to process excel documents, etc. I look forward to getting this project done fairly quickly.
$777 USD in 10 Tagen
0,0 (0 Bewertungen)
0,0
0,0
Avatar des Nutzers
Aucune proposition n'a encore été fournie.
$555 USD in 3 Tagen
0,0 (0 Bewertungen)
0,0
0,0
Avatar des Nutzers
I understand the requirement and will be able to provide this in about a weeks time. Initial confirmation on the data acceptance for task 1 item would be needed to proceed further. I am in US and can be reached at 732-875-9974
$666 USD in 10 Tagen
0,0 (1 Bewertung)
0,0
0,0
Avatar des Nutzers
A proposal has not yet been provided
$555 USD in 10 Tagen
0,0 (0 Bewertungen)
0,0
0,0
Avatar des Nutzers
thanks for the detailed specifications. I can deliver this. please let me know if we can discuss further. thanks.
$722 USD in 15 Tagen
0,0 (0 Bewertungen)
0,0
0,0

Über den Kunden

Flagge von UNITED STATES
United States
5,0
55
Zahlungsmethode verifiziert
Mitglied seit Jan. 18, 2009

Kundenüberprüfung

Danke! Wir haben Ihnen per E-Mail einen Link geschickt, über den Sie Ihr kostenloses Guthaben anfordern können.
Beim Senden Ihrer E-Mail ist ein Fehler aufgetreten. Bitte versuchen Sie es erneut.
Registrierte Benutzer Veröffentlichte Jobs
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Vorschau wird geladen
Erlaubnis zur Geolokalisierung erteilt.
Ihre Anmeldesitzung ist abgelaufen und Sie wurden abgemeldet. Bitte melden Sie sich erneut an.