Marketplace/Spanish-English Bilingual Support Transcripts — 480K Conversations, Code-Switching Annotated
Call Centers

Spanish-English Bilingual Support Transcripts — 480K Conversations, Code-Switching Annotated

Transcribed customer support conversations where agents and callers switch between Spanish and English. Each segment is language-tagged at the sentence level with code-switching points annotated. Sourced from insurance and healthcare support lines. Critical for training multilingual NLP models and bilingual virtual agents.

Formats

JSONL transcriptsTSV aligned pairsParquet

Volume

480K conversations (~62K hours)

Time Range

2021-2026, 5 years

Refresh Rate

Quarterly

Compliance & Privacy

No PIIHIPAACCPA

Interested in this data?

Sign up to express interest, request a sample, or start a deal. Seller identity is revealed only after mutual interest.

Listed April 5, 2026