Faster research workflows · 10% .edu discount
Secure, compliant transcription
Court-ready transcripts and exhibits
HIPAA‑ready transcription
Scale capacity and protect margins
Evidence‑ready transcripts
Meetings into searchable notes
Turn sessions into insights
Ready‑to‑publish transcripts
Customer success stories
Integrations, resellers & affiliates
Security & compliance overview
Coverage in 140+ languages
Our story & mission
Meet the people behind GoTranscript
How‑to guides & industry insights
Open roles & culture
High volume projects, API and dataset labeling
Speak with a specialist about pricing and solutions
Schedule a call - we will confirmation within 24 hours
POs, Net 30 terms and .edu discounts
Help with order status, changes, or billing
Find answers and get support, 24/7
Questions about services, billing or security
Explore open roles and apply.
Human-made, publish-ready transcripts
Broadcast- and streaming-ready captions
Fix errors, formatting, and speaker labels
Clear per-minute rates, optional add-ons, and volume discounts for teams.
"GoTranscript is the most affordable human transcription service we found."
By Meg St-Esprit
Trusted by media organizations, universities, and Fortune 50 teams.
Global transcription & translation since 2005.
Based on 3,762 reviews
We're with you from start to finish, whether you're a first-time user or a long-time client.
Call Support
+1 (831) 222-8398Speaker 1: Next, let me show you the IDPT setup in the home environment. In this part, we are going to introduce you the system setup for the interactive digital widening tutor. We are going to do this in four steps. First, we show you the whole environment in the room, and second, we introduce you the whole hardware system, and then comes with the software system, and last, the video camera calibration. Okay. First, let me show you the home. This is a very ordinary office in the School of Computing, National University of Singapore. Even though we put some curtains on the wall, but it's not very necessary. We can set up the system in any ordinary home environment. Okay. Then we introduce you the hardware system. The interactive digital widening tutor comprises, it contains, first, a laptop PC for the processing or recording of our playing, widening playing, and then we need a microphone to record the audio part, and two normal video cameras to record the hand motion and the finger motion. Okay. So, later I will act as a beginning widening learner. And my partner, Huang Huang, will show you the software part. At last, we will show you the showcase of the IDVT in use.
Speaker 2: This is the interface of our system. The first window is for reference piece display. You can right-click in this window and play a reference piece using the pop-up menu. The second window is for student piece display. You can right-click in this window and conduct student piece recording, transcription, display, and playback here. The third window is for video processing display. You can right-click in this window and conduct video processing, namely fingering analysis and hand tracking. To use the system, the first thing to do is to calibrate the two cameras. Click Options, Input Source, Live Recording. Click here to start video calibration. The views of the two cameras are shown in this window. The calibration of the cameras is really flexible. The only requirement is that the camera capturing the finger should capture the bird's view of the violin from neck to bridge. The camera capturing the hand should capture the movement of the right hand. After calibration, right-click here to start record audio-visual to begin recording. After live recording, we will have one audio file and two video files saved on a hard disk. Now we can do the transcription using these three files as inputs. For better demonstration, here we use the audio and video files captured from professionals. Open student audio. Choose the audio file recorded. Now you have two choices, audio-only transcription and audio-video transcription. Let's click transcription, audio-only and start with audio-only transcription first. Processing complete. The audio-only transcription is displayed in this window. We can open the reference piece to see the difference and find out if the player played correctly. Now we can use video processing to improve audio-only transcription. Choose transcribe audio-visual. Audio processing and video processing can run concurrently. But since we have already done audio processing, we just need to start video processing. Open finger track. Open hand track. Hit play to start processing. Finger tracking result and hand tracking result are shown in this window. Video processing complete. Wait for a few seconds for audio-visual fusion. Fusion complete. Now the display has been updated. Now if you change between the two transcription modes, you can see the difference. If you think the difference seems minute, let's listen. Play the reference piece. Then audio-only result. And audio-visual result. See the difference? This is the showcase of the iDVT in use. By introducing the visual information, that is finger and hand motion, the iDVT can produce more accurate feedback to the violin learners.
Speaker 1: Here are the references mentioned in this video demo. Thank you for your attention. Here is the iDVT demonstration, automatic violin transcription using audio-visual fusion.
Generate a brief summary highlighting the main points of the transcript.
GenerateGenerate a concise and relevant title for the transcript based on the main themes and content discussed.
GenerateIdentify and highlight the key words or phrases most relevant to the content of the transcript.
GenerateExtract key takeaways from the content of the transcript.
GenerateAnalyze the emotional tone of the transcript to determine whether the sentiment is positive, negative, or neutral.
GenerateWe’re Ready to Help
Call or Book a Meeting Now