Blog chevron right How-to Guides

From ELAN to Publication: Exporting Timecoded Excerpts (How-To)

Michael Gallagher
Michael Gallagher
Posted in Zoom Mar 19 · 22 Mar, 2026
From ELAN to Publication: Exporting Timecoded Excerpts (How-To)

To move from ELAN to a publishable quote, you need three things: the right segment, clear timecodes, and enough context for readers to understand it. This guide shows how to select and export timecoded excerpts (text and stills), anonymize sensitive details, and keep an audit trail that links every excerpt back to the original media file.

Primary keyword: exporting timecoded excerpts from ELAN.

Key takeaways

  • Decide what counts as an “excerpt” before you export, then capture both the text and the time span.
  • Export in a format your publishing workflow can use (table/CSV, tab-delimited, or copy-ready text), and keep speaker and tier labels consistent.
  • Anonymize early, but keep a private mapping so you can audit decisions later.
  • Maintain an audit trail that connects each excerpt to: source file, ELAN project, tier, and exact timecodes.

1) What “publishable” means for ELAN excerpts

A publishable excerpt is a short, readable segment with exact start and end timecodes and enough metadata to trace it back to the source. It should also protect participants by removing or masking identifying information when needed.

Before you export anything, decide what your publication needs, because the format drives how you label tiers, speakers, and context notes.

Minimum fields to capture every time

  • Source media ID: filename or a stable identifier you control.
  • ELAN file ID: .eaf name/version (or a project ID).
  • Tier name(s): the tier that holds the excerpt (and any linked tiers like translation).
  • Start time / end time: in a consistent format (e.g., hh:mm:ss.mmm).
  • Speaker label: as shown in your tiers (or mapped to anonymized IDs).
  • Excerpt text: orthography/transcription and (if relevant) translation/gloss.
  • Context note: one sentence that explains what is happening and why it matters.

Decide your excerpt unit

  • Single annotation: one ELAN annotation as one excerpt (fast, clean, but may be too short).
  • Multi-annotation span: combine consecutive annotations into one excerpt (better for readability).
  • Scene-based: define a time window (e.g., 00:12:03.200–00:12:41.900) and pull all tiers in that window.

2) Preparing your ELAN project for clean exports

Exports go smoother when your tiers follow a predictable structure. Fix naming and tier alignment before you start selecting excerpts, because it prevents copy errors later.

A small cleanup pass also helps you build an audit trail that does not fall apart when files move or versions change.

Standardize tier and speaker labels

  • Use stable, readable tier names (e.g., spk_A_transcript, spk_A_translation).
  • Keep speaker labels consistent across files (don’t mix “S1,” “Speaker1,” and a real name).
  • If you plan to anonymize, switch to neutral IDs now (e.g., P01, P02) and store the mapping privately.

Check time alignment and gaps

  • Confirm your excerpt tiers align to the media timeline you will publish against.
  • Watch for overlapping annotations that could confuse a table export.
  • Decide how you will handle pauses, false starts, and interruptions for publication readability.

Create a “context” tier (recommended)

Add a tier specifically for publication notes, such as pub_context. Keep it short, factual, and readable, because it will travel with your excerpt into your manuscript or CMS.

Use it to explain references, off-screen actions, or why the excerpt matters, without adding interpretation you cannot support.

3) Selecting the right segments in ELAN (step-by-step)

The selection step is where most publication mistakes start, because it is easy to grab the right words but the wrong boundaries. Aim to select excerpts that are complete thoughts and still faithful to the original timing.

Keep your selection method consistent so your exports behave the same way across files.

A practical selection checklist

  • Start a little early: include the first word, not the breath before it.
  • End after the idea finishes: avoid chopping off a clause that changes meaning.
  • Include turn-taking cues: if another speaker interrupts, decide whether that belongs in the excerpt.
  • Keep a clean time span: ensure your excerpt window matches what you will cite.

When to combine annotations

  • Combine when the meaning depends on the next line (e.g., a question and answer).
  • Combine when you need a full sentence for readability.
  • Don’t combine when it mixes different topics or different speakers without a clear reason.

Capture context while you select

  • Write a 1–2 sentence note in your pub_context tier or a project log.
  • Record what the reader cannot see in the text (gesture, object handling, room noise, laughter).
  • Note any uncertainties (unclear word, overlapping speech) so you can address them before publishing.

4) Exporting timecoded excerpts: formats that work in publishing

You can export ELAN annotations into table-like formats that editors, researchers, and producers can use. The goal is to preserve timecodes and tier structure while making the excerpt easy to paste into a manuscript, report, or script.

Choose an export format based on who will use it next and what tool they use (Word, Google Docs, Excel, a CMS, or a qualitative coding tool).

Option A: Export a table (CSV / tab-delimited) for editing and review

  • Best for: sorting, filtering, reviewer comments, and building an excerpt library.
  • Include columns for: file ID, tier, speaker, start time, end time, text, translation, context note, anonymization status.
  • Tip: keep one excerpt per row, even if you include multiple tiers as separate columns.

Option B: Export copy-ready text blocks for manuscripts

  • Best for: quickly inserting quotes with citations into a draft.
  • Structure idea: Speaker ID + time range on one line, text on the next, and a short context note in brackets.
  • Tip: keep timecodes visible even if you later hide them for print, because they support verification.

Option C: Export time spans for audio/video clipping

If your publication includes embedded media, you may also want a simple “clip list.” A clip list is just a set of time ranges and labels that someone can use to cut audio/video in an editor.

  • Best for: documentary clips, podcast quotes, web articles with embedded players.
  • Include: source file, in/out time, excerpt label, speaker, and any bleep/redaction instructions.

Screenshots and stills (when you need a visual excerpt)

Sometimes you need a screenshot of the annotation view or a frame from the media to document context. Keep screenshots as supporting material, not as your main “source of truth,” because text exports are easier to audit.

  • Name files consistently: sourceID_start-end_tier.png.
  • Record why you captured it: gesture, object, environment, or on-screen text.
  • Check permissions: only publish visuals if your consent and publication plan allow it.

5) Anonymization: protect people without breaking traceability

Anonymization is not only about removing names. It also includes places, workplaces, unique roles, faces, and any combination of details that could identify someone.

Do the anonymization in a way that keeps your excerpt verifiable for your team, even if the published version hides sensitive details.

What to anonymize (common categories)

  • Direct identifiers: names, phone numbers, addresses, email, ID numbers.
  • Indirect identifiers: job titles, rare conditions, unique events, specific locations.
  • Third parties: people mentioned who did not consent.
  • Audio identifiers: a person’s voice may still identify them in a clip.

Practical anonymization techniques for excerpts

  • Replace with brackets: “I work at [HOSPITAL]” or “We met in [CITY].”
  • Generalize carefully: “cardiologist” → “doctor” if the specialty increases risk.
  • Use participant codes: “Maria” → “P03” (keep the mapping private).
  • Redact partial strings: “john.smith@…” → “[EMAIL].”

Keep two versions on purpose

  • Internal verified excerpt: original words + timecodes + full context, stored securely.
  • Publishable excerpt: anonymized text + timecodes (or citation format), plus a short public-safe context note.

Legal and ethics note (keep it simple)

If you work with personal data, follow your organization’s policies and any applicable rules for consent and privacy. If you are in a regulated environment, confirm whether your workflow needs extra safeguards (storage, access controls, retention).

For background on handling personal data, see the GDPR overview if it applies to your work.

6) Building an audit trail: link every excerpt back to the source

An audit trail lets you prove where an excerpt came from and how it changed from raw data to published text. It also helps your team fix problems fast when an editor asks, “Where is this quote in the recording?”

You do not need a complicated system, but you do need consistency.

What to include in an excerpt audit record

  • Excerpt ID: a unique label like EX-00041.
  • Source media: filename + checksum or version label (if you use one).
  • ELAN file: .eaf filename + last modified date (or version number).
  • Tier path: tier name(s) used (transcript/translation/context).
  • Time span: start/end timecode, plus time format used.
  • Text: internal version and publishable version (if different).
  • Anonymization actions: what you changed and why (brief, factual).
  • Reviewer/approver: initials and date (if your process requires it).

A simple file naming convention that holds up

  • Media: PROJECT_S001_2026-03-22.wav
  • ELAN: PROJECT_S001_2026-03-22.eaf
  • Excerpt export: PROJECT_S001_excerpts_v01.csv
  • Screenshot/still: PROJECT_S001_00-12-03-200_00-12-41-900.png

Linking excerpts back to the source file (two practical methods)

  • Method 1: “Source pointer” column
    • Create a column called source_pointer with: media ID + start/end + tier.
    • Example: PROJECT_S001.wav|00:12:03.200–00:12:41.900|spk_P01_transcript.
  • Method 2: Stable excerpt IDs across tools
    • Assign an EX-xxxxx ID and keep it in ELAN (context tier) and in your export.
    • Use the same ID in your manuscript comments or footnotes during drafting.

Pitfalls that break audit trails

  • Renaming media files after exporting without updating the excerpt library.
  • Changing tier names mid-project so exports no longer match older versions.
  • Copying quotes into drafts without timecodes or excerpt IDs.
  • Editing text for readability without documenting what changed (especially deletions).

Common questions

  • Do I need to keep timecodes in the final publication?
    Often you do not, but you should keep them in your internal draft and notes so you can verify quotes and respond to edits.
  • How long should a publishable excerpt be?
    Long enough to preserve meaning, short enough to stay focused. Many teams aim for one idea per excerpt, then adjust during editing.
  • Should I export translations and glosses with the excerpt?
    Yes if your readers need them, and also if your editor will fact-check. Put them in separate columns so they do not get mixed.
  • What if the audio is unclear in the excerpt window?
    Mark the uncertainty in a note, and consider a verification pass before publishing. Avoid guessing words in a quote.
  • How do I anonymize without losing research value?
    Use consistent placeholders and keep a private mapping and rationale so you can audit and, if needed, re-check details under the right permissions.
  • What’s the best way to share excerpts with collaborators?
    A CSV or spreadsheet with stable excerpt IDs, timecodes, and context works well. Share only the anonymized version if privacy requires it.
  • Can I turn ELAN excerpts into captions or subtitles?
    Sometimes, but captioning has strict formatting rules and timing requirements. If you need publish-ready captions, consider a dedicated caption workflow.

When to bring in transcription or caption support

ELAN is powerful for annotation, but publication workflows often require a clean transcript, consistent style, and careful proofreading. If you need help turning spoken audio into clear, readable text, a transcription workflow can save time and reduce errors.

If you plan to publish video with on-screen text, you may also want dedicated captioning or subtitling formats instead of excerpt tables.

If you want a dependable path from raw recordings to publishable, timecoded text, GoTranscript offers the right solutions, including professional transcription services.