Create an audio forced alignment program

$250-750 USD

Geschlossen

Veröffentlicht

vor fast 8 Jahren

$250-750 USD

Bezahlt bei Lieferung

I need an audio forced alignment program that takes a UTF-8 text file and an audio file, analyzes them, and then aligns the audio to the text in the file. The output should be another text file with the start-time and end-time for each word in the input text file. The program should work in both Windows and Mac. It doesn't matter which programming language you use to accomplish this task, as long as the program executes forced alignment when launched. It doesn't need to have a GUI (the program is an external component to a larger program). Any third-party libraries used should NOT be copy-left. If you think you're up to the job, please apply.

Software Architecture

Projekt-ID: 11039550

Über das Projekt

4 Vorschläge

Remote Projekt

Aktiv vor 8 Jahren

Möchten Sie etwas Geld verdienen?

E-Mail-Adresse

Vorteile einer Ausschreibung auf Freelancer

Legen Sie Ihr Budget und Ihren Zeitrahmen fest

Für Ihre Arbeit bezahlt werden

Skizzieren Sie Ihren Vorschlag

Sie können sich kostenlos anmelden und auf Aufträge bieten

4 Freelancer bieten im Durchschnitt $403 USD für diesen Auftrag

@rsgray

Hi there, This sounds like an interesting project. It is also something that has been done before, so although I would intend to deliver you an original solution (i.e. not containing any copyleft code), there are concepts that can be borrowed from existing libraries such as jtrans, sailalign etc. I have past experience of audio processing projects including speech to text, and I am familiar with several third party libraries in this space. Although it will be non-trivial to provide a solution without using copyleft-licensed code, it will be straightforward to re-implement the required techniques in a self-contained piece of software. Since you want Windows and Mac compatibility I would intend to use a portable language, either Python or Java. Do you have a preference, since it needs to hook into your larger program? In what language will the audio / text be? You mentioned that the text file is utf-8. Will it contain characters from non-latin character sets (e.g. cyrillic, arabic etc.)? Also, what is the duration of the audio files, and is there a large number of files to process? Do you need to be able to perform alignment in real-time or faster? I have entered a low bid for this project as I am keen to increase my rating on this site, but I believe I will be able to provide as good a solution as the higher bidders! Thanks for considering my bid, I hope I can work with you! Rob

$250 USD in 10 Tagen