ELAN lets you create time-aligned transcripts by linking text to exact moments in audio or video. For beginners, the fastest path is simple: create a project, import your media, add a transcription tier, mark short segments, and type your transcript one segment at a time.
This guide walks you through each step in plain language. You’ll also see screenshot-style instructions in text, common mistakes to avoid, and a short practice exercise so you can try ELAN with confidence.
Key takeaways
- Start with one media file and one transcription tier.
- Work in short segments to keep timing clean and easy to edit.
- Name tiers clearly before you add lots of annotations.
- Save early and often, and keep your media file in the same folder.
- Practice with a 30 to 60 second clip before starting a full project.
What ELAN is and when to use it
ELAN is a tool for making annotations that line up with time in audio or video. Many people use it for interviews, language data, classroom recordings, and any project where timing matters.
If you only need plain text, a basic text editor may be enough. If you need each line connected to a start and end time, ELAN is a better fit.
Why beginners choose ELAN
- You can connect each transcript line to an exact span of time.
- You can create several tiers, such as speaker, transcription, translation, or notes.
- You can review playback segment by segment instead of guessing timestamps.
ELAN is widely used in language documentation and annotation workflows from the Max Planck Institute’s ELAN project. If you are doing accessibility work for published video, you may also need captioning rules that differ from research transcription, such as the guidance in the W3C captions overview.
Before you start: set up your project the right way
A clean setup prevents many beginner problems. Before you open ELAN, make one folder for the project and place your media file inside it.
Keep file names short and simple. Avoid changing the media file name after you link it in ELAN.
What to prepare
- One audio or video file, such as MP3, WAV, MP4, or MOV.
- A project folder with a clear name.
- A rough idea of what tiers you need.
- Headphones, if your audio has background noise.
Simple folder structure
- Project folder
- Media file
- ELAN file
- Optional notes file
If you already have machine-generated text, you can compare it against your timed annotations later. For some workflows, teams start with automated transcription and then refine timing and wording inside ELAN.
Step-by-step ELAN tutorial for beginners
Step 1: Create a new ELAN project
Open ELAN and create a new project. You will usually see a top menu, a central workspace, and one or more panels for media and annotations.
- Go to File.
- Select New.
- Choose the option to create a project with a media file.
- Pick your audio or video file.
Screenshot-style view: Imagine the ELAN window with the menu bar at the top. In the center, a waveform or video area appears after you load the file, and the lower part of the screen is ready for annotation tiers.
Step 2: Save the project immediately
Save the ELAN file before you do anything else. This helps ELAN keep a stable link to your media and gives you a restore point if you make a mistake.
- Go to File.
- Select Save As.
- Save the file in the same project folder as your media.
Screenshot-style view: A save dialog opens. The project folder is selected, and the ELAN file name matches the media file name, such as interview-01.eaf.
Step 3: Create your first tier
A tier is a row where you place annotations. For your first project, create one basic transcription tier and keep the name clear.
- Open the Tier menu.
- Select Add New Tier.
- Name it something simple, like Transcription.
- Choose a linguistic type if ELAN asks for one.
- Leave advanced settings at their defaults if you are unsure.
Screenshot-style view: A small settings window appears. There is a field for Tier Name, a dropdown for Linguistic Type, and buttons to confirm or cancel.
Step 4: Add more tiers only if you need them
Beginners often create too many tiers too early. Start with one or two, then expand later.
Good optional tiers include:
- Speaker
- Translation
- Notes
- Non-speech sounds
If you will share the final text outside ELAN, a clean transcript matters as much as timing. Some teams use transcription proofreading services after the first pass to catch wording and formatting issues.
Step 5: Learn the main panels
Before you annotate, take a minute to see where things are. This saves a lot of frustration later.
- Media player area: plays audio or video.
- Timeline or waveform: shows time visually.
- Tier area: holds your annotation rows.
- Current selection area: shows the segment you are working on.
Screenshot-style view: At the top left, a play button and time counter. Across the middle, a waveform or timeline. At the bottom, one row labeled Transcription waiting for annotations.
Step 6: Make your first time selection
Time-aligned transcription starts with selecting a short piece of audio or video. For beginners, 2 to 6 seconds per segment works well.
- Play the media.
- Pause when you hear the start of a phrase.
- Click or drag on the timeline to mark the start and end time.
- Create an annotation for that selected span on the Transcription tier.
Screenshot-style view: A small highlighted block appears on the timeline. The same block now appears on the Transcription tier as a new empty annotation.
Step 7: Enter the transcript text
Once the segment exists, click inside the annotation and type what you hear. Keep the text short and faithful to the audio.
- Double-click the annotation block, or use the edit option.
- Type the words for that time span.
- Save your changes.
Screenshot-style view: A text entry box opens over or near the selected annotation. The segment time remains fixed while you type the transcript text.
Step 8: Continue segment by segment
Repeat the same pattern through the file: listen, select, annotate, type, and review. This is slower at first, but it creates much cleaner alignment.
- Keep segments short.
- Break at natural pauses when possible.
- Use speaker changes as segment boundaries.
- Replay difficult parts several times.
Step 9: Review timing and text together
After you finish a short section, play it back while watching the annotations. Check that each annotation starts and ends in the right place and that the text matches the speech.
- Adjust boundaries that cut off words.
- Split long annotations into smaller ones.
- Merge very short annotations if they belong together.
How to segment audio well as a beginner
Good segmentation makes the whole transcript easier to read and fix. Bad segmentation creates editing problems later.
Use these simple rules
- One idea or short phrase per segment.
- 2 to 6 seconds is a safe starting range.
- Start at the first clear sound of the phrase.
- End after the last word, not too early.
- Split when the speaker changes.
When to create shorter segments
- Fast speech
- Overlapping speakers
- Heavy accents
- Important pauses
- Detailed linguistic analysis
When longer segments may be fine
- Slow, clear speech
- Simple monologues
- Rough first-pass annotation
If your final goal is subtitles rather than a research transcript, you may later move from ELAN to dedicated subtitling services or caption workflows with different line-length and reading-speed rules.
Common beginner mistakes in ELAN and how to avoid them
1. Creating too many tiers too early
This makes the screen busy and slows down your decisions. Start with Transcription, then add only what you truly need.
2. Making segments too long
Long segments are harder to review and edit. Keep them short so you can fix timing without affecting a large section.
3. Forgetting to save often
Save after each short section. It is a simple habit that protects your work.
4. Moving or renaming the media file
If ELAN cannot find the media, your project becomes harder to reopen. Keep the media file in the same folder and avoid renaming it mid-project.
5. Typing before checking the selected time span
Some beginners type text into the wrong annotation or with the wrong boundaries. Confirm the selected segment first, then enter text.
6. Ignoring playback review
Timing that looks fine on screen can still feel wrong in playback. Always listen back to a few recent annotations before moving on.
7. Using inconsistent text rules
Decide early how you will handle fillers, false starts, laughter, or unclear speech. Consistent rules make the transcript easier to use later.
Simple practice exercise: create your first time-aligned transcript
Practice on a short clip before you start a real project. A 30 to 60 second file is enough.
Goal
Create one ELAN project with one Transcription tier and at least 5 time-aligned annotations.
Instructions
- Choose a short clip with one clear speaker.
- Create a new ELAN project and save it.
- Add one tier called Transcription.
- Listen once without typing.
- On the second pass, create 5 short segments.
- Enter the transcript text for each segment.
- Play the full clip and adjust any segment that starts late or ends early.
Self-check questions
- Does every annotation have text?
- Do the segment boundaries match the spoken phrases?
- Are any segments too long to review easily?
- Would another person understand your tier names?
A simple transcription rule set for practice
- Write words as spoken.
- Mark unclear words with a simple tag you can recognize later.
- Use one segment per short phrase.
- Keep your formatting consistent from start to finish.
Common questions
Is ELAN hard for beginners?
It can feel unfamiliar at first, but the basic workflow is simple. Start with one media file and one tier, and learn segmentation before anything advanced.
What is a tier in ELAN?
A tier is a row for annotations. You might use one tier for transcription, another for translation, and another for speaker notes.
How long should each segment be?
For beginners, 2 to 6 seconds is a useful range. Shorter segments are easier to review and retime.
Can I transcribe audio only, or do I need video?
You can use ELAN with audio or video. The main idea is the same: text linked to time.
Should I create separate tiers for each speaker?
Only if it helps your project. Many beginners do better with one transcription tier first and add speaker detail later if needed.
What if I cannot hear a word clearly?
Mark it with a consistent placeholder and move on. You can return during review with headphones or after checking context.
Can I use ELAN for subtitles or captions?
You can use it for timed text work, but subtitle and caption delivery often has extra formatting requirements. Check the needs of your final platform before you export or publish.
Final thoughts
The best way to learn ELAN is to keep the first project small. One file, one tier, short segments, and steady review will teach you the core workflow much faster than trying every feature at once.
If you need help turning audio or video into accurate text before or after your ELAN workflow, GoTranscript provides the right solutions, including professional transcription services.