Database for pasting email data written in node.js
£20-250 GBP
In Bearbeitung
Veröffentlicht vor fast 4 Jahren
£20-250 GBP
Bezahlt bei Lieferung
We need a database which has a lightweight interface to allow the pasting of spreadsheet data in, and checks for duplications in the data whereby a duplicate is constituted primarily by having the same email.
The headers for the pasted data is attached.
- We need a DB to hold textual semi-structured data
- As such a relational DB (like MySQL) may not be the best fit
- Instead we need something like MongoDB
- Users will have access to a website where they can paste JSON strings of data
- The website will then have a backend that runs a distance function search of that string v/s what is in the DB (check out NLP textual distance functions)
- If the backend process finds an exact match, that string is discarded
- If the backend find a partial match, then that string is added to the DB but we also add a new tag in the JSON which will say "potential dupe" and "dupe probability = XX" where XX is the similarity we got from the distance function
We're also looking for other node.js developers to help with other ongoing projects so if you do well on this project we'll definitely have more work. We look for people who are familiar with good software development practices and keep nice clean code :).