SlideShare a Scribd company logo
1 of 16
Download to read offline
Transcript Alignment Service
Webinar
March 12, 2013
Moderator: Josh Miller
Speakers: Roger Zimmerman
David Zylber
Agenda
• Automatic Alignment vs. Transcription
and Captioning
• Alignment Service Overview
• Best Practices
• Submitting Transcripts & Media Files
• Formatting your Transcripts
• Q&A
Transcript Alignment Service vs.
Transcription and Captioning
• Use the Alignment service when you already
have a transcript
• Both services ultimately give you access to the
same 3Play Media account features and tools.
• Alignment is 100% automated where as the
standard service involves human clean up.
• Turnaround Service Levels
Automatic Alignment Process

1)Re-encode text as ASCII
•

MS-Word exports still contain non-ASCII characters

•

Direct upload users can see the results
DEMO
FTP Overview
• Create a folder named for_alignment

• Add the media file first to the for_alignment folder
- e.g. Casablanco.mp4

• Then add the plain .TXT transcript to the for_alignment folder
- e.g. Casablanco.txt

• The .TXT file MUST HAVE THE SAME NAME as the media
file

• Batch uploads: first submit all media files and then the
corresponding transcripts.
Alignment Best Practices
• THE KEY: Text corresponds to audio!
• Common Problems:
-Non-conforming speaker labels (not all caps, hyphens instead of colons
-Wrapped text becomes paragraphs
-Including instructions, screen directions, scene settings/headers
-Interpretation
-Overlapping speakers
-Audio quality
• Duration: No more than 2 hours per file
• Drag and Drop your transcripts when you can
• Transcripts should be unformatted plain text file (.TXT)
• Short duration reduces the likelihood of misalignment
DEMO
Automatic Alignment Process
continued…
2) Infer verbalization from text
•

Speaker labels used for adaptation (and replaced with
optional pause)

•

Punctuation removed (sentences replaced with pause)

•

Numerics expanded:
 10/10/2013 => “ten ten thirteen” OR “October tenth” …
 107 => “one hundred and seven” OR “one oh seven” …
 5’3” => “five foot three” or “five three” …

•

Acronyms/abbreviations expanded: “St.”, “ABC”, “NASDAQ”
Automatic Alignment Process
3) Build a “biased” language model (with options):
CEO: “On 10/10/2013, we will be listed on NASDAQ as ABC”
<SPEAKER> on { NULL / this } { ten ten / october tenth }
<COMMA> { NULL / twenty thirteen / thirteen } { we will / we’ll
} be listed on the nasdaq as a b c <SENTENCE> …
Automatic Alignment Process
4) Run ASR with biased LM:
ON
OCTOBER
TENTH
WE’LL
BE

1.02 1.05
1.05 1.32
1.32 1.51
1.63 1.76
1.76 1.82
Automatic Alignment Process
5) Re-Align with original text:






ON
OCTOBER
TENTH
WE’LL




BE

CEO:
On

0.0
1.02

1.02
1.05

10/10/2013,
we
will
be

1.05
1.63
1.695
1.76

1.51
1.695
1.76
1.82
Automatic Alignment Process

6) Fill in gaps in ASR output with all of the original transcript
text in that region.
Automatic Alignment Process
7) Compute confidence from ASR process plus
number/length of gaps.
 “Audio Quality” bars
Automatic Alignment Process

8) Create all output assets from the aligned transcript, as if it
had been edited.
NEED HELP?
RESOURCES

Knowledge Base

support.3playmedia.com/forums
Contact 3Play Media Support

support@3playmedia.com

More Related Content

Viewers also liked

Pennsylvania State of Higher Education (PASSHE) Virtual Conference
 Pennsylvania State of Higher Education (PASSHE) Virtual Conference Pennsylvania State of Higher Education (PASSHE) Virtual Conference
Pennsylvania State of Higher Education (PASSHE) Virtual Conference3Play Media
 
AAPF Grantmakers in Education Presentation
AAPF Grantmakers in Education PresentationAAPF Grantmakers in Education Presentation
AAPF Grantmakers in Education Presentationaapfslides
 
Derivativemarketinnepal 130710115026-phpapp02
Derivativemarketinnepal 130710115026-phpapp02Derivativemarketinnepal 130710115026-phpapp02
Derivativemarketinnepal 130710115026-phpapp02adnanabbas
 
Secion educativa sobre el pandillaje
Secion educativa  sobre el pandillajeSecion educativa  sobre el pandillaje
Secion educativa sobre el pandillaje990672000
 
Accessibility at Blackboard
Accessibility at BlackboardAccessibility at Blackboard
Accessibility at Blackboard3Play Media
 

Viewers also liked (6)

Pennsylvania State of Higher Education (PASSHE) Virtual Conference
 Pennsylvania State of Higher Education (PASSHE) Virtual Conference Pennsylvania State of Higher Education (PASSHE) Virtual Conference
Pennsylvania State of Higher Education (PASSHE) Virtual Conference
 
Didactiko 1
Didactiko 1Didactiko 1
Didactiko 1
 
AAPF Grantmakers in Education Presentation
AAPF Grantmakers in Education PresentationAAPF Grantmakers in Education Presentation
AAPF Grantmakers in Education Presentation
 
Derivativemarketinnepal 130710115026-phpapp02
Derivativemarketinnepal 130710115026-phpapp02Derivativemarketinnepal 130710115026-phpapp02
Derivativemarketinnepal 130710115026-phpapp02
 
Secion educativa sobre el pandillaje
Secion educativa  sobre el pandillajeSecion educativa  sobre el pandillaje
Secion educativa sobre el pandillaje
 
Accessibility at Blackboard
Accessibility at BlackboardAccessibility at Blackboard
Accessibility at Blackboard
 

Similar to Align Transcripts to Audio Webinar

IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server
IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server
IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server Antonios Chatzipavlis
 
La big datacamp-2014-aws-dynamodb-overview-michael_limcaco
La big datacamp-2014-aws-dynamodb-overview-michael_limcacoLa big datacamp-2014-aws-dynamodb-overview-michael_limcaco
La big datacamp-2014-aws-dynamodb-overview-michael_limcacoData Con LA
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translationStephen Peacock
 
A Brief Intro to Adobe Flex
A Brief Intro to Adobe FlexA Brief Intro to Adobe Flex
A Brief Intro to Adobe FlexChad Udell
 
COE 2017: Your first 3DEXPERIENCE customization
COE 2017: Your first 3DEXPERIENCE customizationCOE 2017: Your first 3DEXPERIENCE customization
COE 2017: Your first 3DEXPERIENCE customizationRazorleaf Corporation
 
Unit 5 application layer
Unit 5 application layerUnit 5 application layer
Unit 5 application layerKritika Purohit
 
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSAmazon Web Services
 
Deep Dive on Elastic File System - February 2017 AWS Online Tech Talks
Deep Dive on Elastic File System - February 2017 AWS Online Tech TalksDeep Dive on Elastic File System - February 2017 AWS Online Tech Talks
Deep Dive on Elastic File System - February 2017 AWS Online Tech TalksAmazon Web Services
 
Programming Languages #devcon2013
Programming Languages #devcon2013Programming Languages #devcon2013
Programming Languages #devcon2013Iván Montes
 
3-Application Layer.pptx
3-Application Layer.pptx3-Application Layer.pptx
3-Application Layer.pptxSachinDUpadhye
 
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud - Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFSAmazon Web Services
 

Similar to Align Transcripts to Audio Webinar (20)

Build your own ASR engine
Build your own ASR engineBuild your own ASR engine
Build your own ASR engine
 
IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server
IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server
IT PRO|DEV CONNECTIONS 2013 - The X-Files of SQL Server
 
La big datacamp-2014-aws-dynamodb-overview-michael_limcaco
La big datacamp-2014-aws-dynamodb-overview-michael_limcacoLa big datacamp-2014-aws-dynamodb-overview-michael_limcaco
La big datacamp-2014-aws-dynamodb-overview-michael_limcaco
 
Deep Learning Summit (DLS01-4)
Deep Learning Summit (DLS01-4)Deep Learning Summit (DLS01-4)
Deep Learning Summit (DLS01-4)
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
 
A Brief Intro to Adobe Flex
A Brief Intro to Adobe FlexA Brief Intro to Adobe Flex
A Brief Intro to Adobe Flex
 
COE 2017: Your first 3DEXPERIENCE customization
COE 2017: Your first 3DEXPERIENCE customizationCOE 2017: Your first 3DEXPERIENCE customization
COE 2017: Your first 3DEXPERIENCE customization
 
Unit 5 application layer
Unit 5 application layerUnit 5 application layer
Unit 5 application layer
 
Ms DOS
Ms DOSMs DOS
Ms DOS
 
15. text files
15. text files15. text files
15. text files
 
Moses
MosesMoses
Moses
 
Introduction
IntroductionIntroduction
Introduction
 
Introduction to .Net
Introduction to .NetIntroduction to .Net
Introduction to .Net
 
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud – Amazon EFS
 
Tool
ToolTool
Tool
 
Deep Dive on Elastic File System - February 2017 AWS Online Tech Talks
Deep Dive on Elastic File System - February 2017 AWS Online Tech TalksDeep Dive on Elastic File System - February 2017 AWS Online Tech Talks
Deep Dive on Elastic File System - February 2017 AWS Online Tech Talks
 
Programming Languages #devcon2013
Programming Languages #devcon2013Programming Languages #devcon2013
Programming Languages #devcon2013
 
3-Application Layer.pptx
3-Application Layer.pptx3-Application Layer.pptx
3-Application Layer.pptx
 
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFSSimple, Scalable and Highly Durable NAS in the Cloud - Amazon EFS
Simple, Scalable and Highly Durable NAS in the Cloud - Amazon EFS
 
Stupid Video Tricks
Stupid Video TricksStupid Video Tricks
Stupid Video Tricks
 

More from 3Play Media

Advancing Equity and Inclusion for Deaf Students in Higher Education
Advancing Equity and Inclusion for Deaf Students in Higher EducationAdvancing Equity and Inclusion for Deaf Students in Higher Education
Advancing Equity and Inclusion for Deaf Students in Higher Education3Play Media
 
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity3Play Media
 
The 3Play Way: Real-Time Captioning in Higher Education
The 3Play Way: Real-Time Captioning in Higher EducationThe 3Play Way: Real-Time Captioning in Higher Education
The 3Play Way: Real-Time Captioning in Higher Education3Play Media
 
Developing a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State UniversityDeveloping a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State University3Play Media
 
Developing a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State UniversityDeveloping a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State University3Play Media
 
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...3Play Media
 
Product Innovation is on the Edge
Product Innovation is on the EdgeProduct Innovation is on the Edge
Product Innovation is on the Edge3Play Media
 
Why Every Company Needs to Think and Act Like a Media Company
Why Every Company Needs to Think and Act Like a Media CompanyWhy Every Company Needs to Think and Act Like a Media Company
Why Every Company Needs to Think and Act Like a Media Company3Play Media
 
2023 State of Automatic Speech Recognition
2023 State of Automatic Speech Recognition2023 State of Automatic Speech Recognition
2023 State of Automatic Speech Recognition3Play Media
 
Complex Identities: The Intersection of Disability with Race, Culture, Gender...
Complex Identities: The Intersection of Disability with Race, Culture, Gender...Complex Identities: The Intersection of Disability with Race, Culture, Gender...
Complex Identities: The Intersection of Disability with Race, Culture, Gender...3Play Media
 
Accessibility as a Gateway to Creativity
Accessibility as a Gateway to CreativityAccessibility as a Gateway to Creativity
Accessibility as a Gateway to Creativity3Play Media
 
Disability Inclusion for Leadership
Disability Inclusion for LeadershipDisability Inclusion for Leadership
Disability Inclusion for Leadership3Play Media
 
How to Tell Whether UDL is Working
How to Tell Whether UDL is WorkingHow to Tell Whether UDL is Working
How to Tell Whether UDL is Working3Play Media
 
Neurodivergency at work (P2) – 3Play and B-I.pdf
Neurodivergency at work (P2) – 3Play and B-I.pdfNeurodivergency at work (P2) – 3Play and B-I.pdf
Neurodivergency at work (P2) – 3Play and B-I.pdf3Play Media
 
Neurodiversity in the Workplace - Part 1
Neurodiversity in the Workplace - Part 1Neurodiversity in the Workplace - Part 1
Neurodiversity in the Workplace - Part 13Play Media
 
How To Deliver an Accessible Online Presentation
How To Deliver an Accessible Online PresentationHow To Deliver an Accessible Online Presentation
How To Deliver an Accessible Online Presentation3Play Media
 
Power of an Accessible Website.pdf
Power of an Accessible Website.pdfPower of an Accessible Website.pdf
Power of an Accessible Website.pdf3Play Media
 
2022 Digital Accessibility Legal Update.pdf
2022 Digital Accessibility Legal Update.pdf2022 Digital Accessibility Legal Update.pdf
2022 Digital Accessibility Legal Update.pdf3Play Media
 
Intro to Live Captioning for Broadcast.pdf
Intro to Live Captioning for Broadcast.pdfIntro to Live Captioning for Broadcast.pdf
Intro to Live Captioning for Broadcast.pdf3Play Media
 
How to Scale a Sustainable Accessibility Program
How to Scale a Sustainable Accessibility Program How to Scale a Sustainable Accessibility Program
How to Scale a Sustainable Accessibility Program 3Play Media
 

More from 3Play Media (20)

Advancing Equity and Inclusion for Deaf Students in Higher Education
Advancing Equity and Inclusion for Deaf Students in Higher EducationAdvancing Equity and Inclusion for Deaf Students in Higher Education
Advancing Equity and Inclusion for Deaf Students in Higher Education
 
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity
"Am I Doing This Right?" Imposter Syndrome and Accessibility Maturity
 
The 3Play Way: Real-Time Captioning in Higher Education
The 3Play Way: Real-Time Captioning in Higher EducationThe 3Play Way: Real-Time Captioning in Higher Education
The 3Play Way: Real-Time Captioning in Higher Education
 
Developing a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State UniversityDeveloping a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State University
 
Developing a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State UniversityDeveloping a Centrally Supported Captioning System with Utah State University
Developing a Centrally Supported Captioning System with Utah State University
 
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...
Lessons Learned: Canada’s Past, Present, and Future Leadership in Digital Acc...
 
Product Innovation is on the Edge
Product Innovation is on the EdgeProduct Innovation is on the Edge
Product Innovation is on the Edge
 
Why Every Company Needs to Think and Act Like a Media Company
Why Every Company Needs to Think and Act Like a Media CompanyWhy Every Company Needs to Think and Act Like a Media Company
Why Every Company Needs to Think and Act Like a Media Company
 
2023 State of Automatic Speech Recognition
2023 State of Automatic Speech Recognition2023 State of Automatic Speech Recognition
2023 State of Automatic Speech Recognition
 
Complex Identities: The Intersection of Disability with Race, Culture, Gender...
Complex Identities: The Intersection of Disability with Race, Culture, Gender...Complex Identities: The Intersection of Disability with Race, Culture, Gender...
Complex Identities: The Intersection of Disability with Race, Culture, Gender...
 
Accessibility as a Gateway to Creativity
Accessibility as a Gateway to CreativityAccessibility as a Gateway to Creativity
Accessibility as a Gateway to Creativity
 
Disability Inclusion for Leadership
Disability Inclusion for LeadershipDisability Inclusion for Leadership
Disability Inclusion for Leadership
 
How to Tell Whether UDL is Working
How to Tell Whether UDL is WorkingHow to Tell Whether UDL is Working
How to Tell Whether UDL is Working
 
Neurodivergency at work (P2) – 3Play and B-I.pdf
Neurodivergency at work (P2) – 3Play and B-I.pdfNeurodivergency at work (P2) – 3Play and B-I.pdf
Neurodivergency at work (P2) – 3Play and B-I.pdf
 
Neurodiversity in the Workplace - Part 1
Neurodiversity in the Workplace - Part 1Neurodiversity in the Workplace - Part 1
Neurodiversity in the Workplace - Part 1
 
How To Deliver an Accessible Online Presentation
How To Deliver an Accessible Online PresentationHow To Deliver an Accessible Online Presentation
How To Deliver an Accessible Online Presentation
 
Power of an Accessible Website.pdf
Power of an Accessible Website.pdfPower of an Accessible Website.pdf
Power of an Accessible Website.pdf
 
2022 Digital Accessibility Legal Update.pdf
2022 Digital Accessibility Legal Update.pdf2022 Digital Accessibility Legal Update.pdf
2022 Digital Accessibility Legal Update.pdf
 
Intro to Live Captioning for Broadcast.pdf
Intro to Live Captioning for Broadcast.pdfIntro to Live Captioning for Broadcast.pdf
Intro to Live Captioning for Broadcast.pdf
 
How to Scale a Sustainable Accessibility Program
How to Scale a Sustainable Accessibility Program How to Scale a Sustainable Accessibility Program
How to Scale a Sustainable Accessibility Program
 

Recently uploaded

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Recently uploaded (20)

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Align Transcripts to Audio Webinar

  • 1. Transcript Alignment Service Webinar March 12, 2013 Moderator: Josh Miller Speakers: Roger Zimmerman David Zylber
  • 2. Agenda • Automatic Alignment vs. Transcription and Captioning • Alignment Service Overview • Best Practices • Submitting Transcripts & Media Files • Formatting your Transcripts • Q&A
  • 3. Transcript Alignment Service vs. Transcription and Captioning • Use the Alignment service when you already have a transcript • Both services ultimately give you access to the same 3Play Media account features and tools. • Alignment is 100% automated where as the standard service involves human clean up. • Turnaround Service Levels
  • 4. Automatic Alignment Process 1)Re-encode text as ASCII • MS-Word exports still contain non-ASCII characters • Direct upload users can see the results
  • 6. FTP Overview • Create a folder named for_alignment • Add the media file first to the for_alignment folder - e.g. Casablanco.mp4 • Then add the plain .TXT transcript to the for_alignment folder - e.g. Casablanco.txt • The .TXT file MUST HAVE THE SAME NAME as the media file • Batch uploads: first submit all media files and then the corresponding transcripts.
  • 7. Alignment Best Practices • THE KEY: Text corresponds to audio! • Common Problems: -Non-conforming speaker labels (not all caps, hyphens instead of colons -Wrapped text becomes paragraphs -Including instructions, screen directions, scene settings/headers -Interpretation -Overlapping speakers -Audio quality • Duration: No more than 2 hours per file • Drag and Drop your transcripts when you can • Transcripts should be unformatted plain text file (.TXT) • Short duration reduces the likelihood of misalignment
  • 9. Automatic Alignment Process continued… 2) Infer verbalization from text • Speaker labels used for adaptation (and replaced with optional pause) • Punctuation removed (sentences replaced with pause) • Numerics expanded:  10/10/2013 => “ten ten thirteen” OR “October tenth” …  107 => “one hundred and seven” OR “one oh seven” …  5’3” => “five foot three” or “five three” … • Acronyms/abbreviations expanded: “St.”, “ABC”, “NASDAQ”
  • 10. Automatic Alignment Process 3) Build a “biased” language model (with options): CEO: “On 10/10/2013, we will be listed on NASDAQ as ABC” <SPEAKER> on { NULL / this } { ten ten / october tenth } <COMMA> { NULL / twenty thirteen / thirteen } { we will / we’ll } be listed on the nasdaq as a b c <SENTENCE> …
  • 11. Automatic Alignment Process 4) Run ASR with biased LM: ON OCTOBER TENTH WE’LL BE 1.02 1.05 1.05 1.32 1.32 1.51 1.63 1.76 1.76 1.82
  • 12. Automatic Alignment Process 5) Re-Align with original text:      ON OCTOBER TENTH WE’LL   BE CEO: On 0.0 1.02 1.02 1.05 10/10/2013, we will be 1.05 1.63 1.695 1.76 1.51 1.695 1.76 1.82
  • 13. Automatic Alignment Process 6) Fill in gaps in ASR output with all of the original transcript text in that region.
  • 14. Automatic Alignment Process 7) Compute confidence from ASR process plus number/length of gaps.  “Audio Quality” bars
  • 15. Automatic Alignment Process 8) Create all output assets from the aligned transcript, as if it had been edited.