طراحی یک پیکره‌ی گفتاری در مطالعات آواشناسی تجربی: تصمیم‌ها و چالش‌های روش‌شناختی

موسوی, ندا

doi:10.22034/nf.2026.577011.1518

طراحی یک پیکره‌ی گفتاری در مطالعات آواشناسی تجربی: تصمیم‌ها و چالش‌های روش‌شناختی

مقالات آماده انتشار

نوع مقاله : مقاله پژوهشی

نویسنده

ندا موسوی

دپارتمان علم گفتار و آواشناسی،‌ دانشگاه مارتین لوتر هاله-ویتنبرگ و موسسه‌ی ماکس پلانک برای زیبایی‌شناسی تجربی، فرانکفورت

10.22034/nf.2026.577011.1518

چکیده

هدف این مقاله ارائه‌ی مروری روش‌شناختی بر مراحل طراحی و ساخت یک پیکره‌ی گفتاری با تمرکز بر تصمیم‌ها و چالش‌های فرایند گردآوری داده‌های گفتاری استاندارد در مطالعات آواشناسی تجربی است. این مرور بر پایه‌ی تجربه‌های حاصل از طراحی و پیاده‌سازی یک پیکره‌ی گفتاری در چارچوب رساله‌ی دکترای نویسنده انجام شده است. پیکره‌ی اصلی ماهیتی دوزبانه دارد، اما تمرکز این مقاله صرفاً بر بخش فارسی آن است. در این نوشتار، مراحل مختلف ساخت پیکره، از جمله انتخاب و طراحی وظایف برانگیزش گفتار، نمونه‌گیری از گویندگان، طراحی محیط و شرایط ضبط، سازمان‌دهی و نام‌گذاری داده‌ها، و در نهایت پیش‌پردازش و تقطیع گفتار،‌ به‌صورت نظام‌مند مورد بحث قرار می‌گیرند. نشان داده می‌شود که طراحی پیکره فرایندی صرفاً اجرایی نیست، بلکه مجموعه‌ای از تصمیم‌های روش‌شناختی آگاهانه را دربر می‌گیرد که مستقیماً بر کیفیت داده‌ها، تفسیرپذیری نتایج و امکان بازتولید تحلیل‌ها اثر می‌گذارند. همچنین مقاله به چالش‌های مرتبط با استفاده از روش‌های خودکار در تقطیع گفتار پیکره‌های فارسی می‌پردازد و محدودیت‌های ابزارهای بازشناسی خودکار گفتار را در بافت زبان‌های کم‌منبع برجسته می‌کند. در پایان استدلال می‌شود که مستندسازی دقیق این تصمیم‌ها و مراحل، خود بخشی اساسی از فرایند ساخت پیکره است و نقشی تعیین‌کننده در شفافیت روش، ارزیابی نتایج و امکان استفاده‌ی علمی از داده‌ها ایفا می‌کند.

کلیدواژه‌ها

پیکره‌ی گفتاری

‌ برانگیزش گفتار

تقطیع گفتار

بازشناسی خودکار گفتار

زبان‌های کم‌منبع

موضوعات

زبان‌ها و گویش‌های ایرانی

عنوان مقاله English

Designing a speech corpus for studies in experimental phonetics: Methodological decisions and challenges

نویسنده English

Neda Mousavi

Department of Speech Science and Phonetics, Martin Luther University Halle-Wittenberg, and Max Planck Institute for Empirical Aesthetics, Frankfurt am Main

چکیده English

This paper provides a methodological overview of the stages involved in designing and constructing a speech corpus, with particular emphasis on the decisions and challenges that arise when collecting standardized speech data for experimental phonetics. The discussion draws on the author’s experience developing a speech corpus as part of a doctoral dissertation. Although the original corpus is bilingual, containing Persian and German data, the present article focuses exclusively on the Persian component. The paper describes the key stages of corpus construction, including the design and selection of speech elicitation tasks, speaker sampling, recording environment and conditions, data organization and naming conventions, and procedures for speech preprocessing and segmentation. It argues that corpus construction is not merely a technical or operational task, but a sequence of deliberate methodological decisions that directly affect data quality, the interpretability of results, and the reproducibility of analyses. In addition, the paper discusses the challenges of applying automatic segmentation methods to Persian speech, highlighting the limitations of automatic speech recognition and romanization tools in low-resource language contexts. It further argues that systematically documenting methodological decisions is itself an essential component of corpus construction, as such documentation promotes transparency, enables critical evaluation of results, and enhances the long-term scientific usability of the data.

کلیدواژه‌ها English

Speech corpus

Speech elicitation

Speech segmentation

Automatic speech recognition (ASR)

Low-resource languages