A Unified Multimodal Dataset for Brazilian Sign Language Tasks
⚠️ Important: Our dataset requires credentialized access. The datasets on HuggingFace are gated and access must be authorized after you agree to the terms and conditions. Please visit the dataset repositories and request access through HuggingFace.
Captar-Libras is a large-scale multimodal dataset for Brazilian Sign Language (Libras), comprising video sequences from native Deaf signers performing sentences in the medical domain. Data was collected in a controlled capture environment using four synchronized cameras: a frontal RGB-D camera, two wide-angle cameras covering lateral and zenithal views, and a dedicated high-definition facial camera. This enables rich spatial and temporal modeling of body, hand, and facial expressions.
Each recording follows a guided protocol to ensure linguistic consistency and is annotated with frame-level gloss temporal alignments produced by specialist annotators. The dataset includes raw multi-view recordings, preprocessed video segments, structured gloss and text annotations, and precomputed full-body pose sequences in SMPL-X format. A signer-independent train/validation/test split stratified by gender and skin tone is provided as the recommended evaluation protocol.
Video Sequences
Signers (50F / 28M)
Sentences
Camera Views
Structured gloss and text annotations for all sequences in the dataset.
Show Download SnippetHigh-resolution frontal RGB video streams from the frontal camera.
Show Download SnippetDepth sequences from the frontal RGB-D camera for 3D reconstruction.
Show Download SnippetWide-angle camera covering zenithal perspective (top-down view).
Show Download SnippetDedicated high-definition facial camera for fine-grained facial expression analysis.
Show Download SnippetPrecomputed full-body 3D pose sequences in SMPL-X format for all sequences.
Show Download SnippetPreprocessed video segments used in the baseline experiments.
Show Download SnippetIf you use Captar-Libras in your research, please cite it using the following BibTeX entry:
@inproceedings{magalhaes2026captarlibras,
title={Captar-Libras: A Unified Multimodal Dataset for Brazilian Sign Language Tasks},
author={Cauã Magalhães and Bruno Lages and Ari Filho and Jéssica Ramos and Paulo de Souza Coelho and Michel Silva and Thiago L. Gomes and Milena Soriano Marcolino and Elidéa Bernardino and Raquel O. Prates and Mario F. M. Campos and Erickson R. Nascimento},
year={2026},
booktitle={The Fortieth Annual Conference on Neural Information Processing Systems Evaluations and Datasets Track}
}