-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Word-precision timecodes for audio w/o lyrics in online databases, optionally providing manual transcript #6
Comments
Sample fileThis is a short audio sample file with 4 lines:
Audio Filesing-rap-read.mp4
LyricsLine by line lyrics
Remarks on each lyrics line — What it tests for
Test results of various AI lyrics detection apps
|
Croonify
sing-rap-read--croonfiy.mp4 |
Noraebang by Gaudio LabOverall verdict: Quite good at some positions. But at pauses or stretchings still fails miserably. Possibly only its trickery/estimation is better. Doubting that real full phonetical mapping takes place, as the failure with word pauses indicates. sing-rap-read--noraebang-by-gaudio-lab.mp4
|
Your software: Karaokenerds Lyrics Transcriber
sing-rap-read--karaokenerds-lyrics-transcriber.mp4
|
Open
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Are the following use cases supported?
Goal / Desired output: Lyrics or subtitle file with word-precision timecodes
Starting point(s)
The text was updated successfully, but these errors were encountered: