# Tadabur：大规模古兰经音频数据集

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-04-21 08:00
- AIHOT 链接：https://aihot.virxact.com/items/cmob7jfls06f9sl1y7fqry91q
- 原文链接：https://arxiv.org/abs/2604.18932

## AI 摘要

研究团队发布Tadabur大规模古兰经音频数据集，收录逾1400小时朗诵音频，涵盖600余位不同朗诵者在多样化录音条件下的演绎。该数据集在朗诵风格、声音特征方面具有显著差异性，大幅扩展了现有古兰经语音数据的规模与变异性，旨在为相关研究提供全面资源并推动标准化基准建立。

## 正文

Despite growing interest in Quranic data research, existing Quran datasets remain limited in both scale and diversity. To address this gap, we present Tadabur, a large-scale Quran audio dataset. Tadabur comprises more than 1400+ hours of recitation audio from over 600 distinct reciters, providing substantial variation in recitation styles, vocal characteristics, and recording conditions. This diversity makes Tadabur a comprehensive and representative resource for Quranic speech research and analysis. By significantly expanding both the total duration and variability of available Quran data, Tadabur aims to support future research and facilitate the development of standardized Quranic speech benchmarks.
