Welcome to Map Task Corpus of Heritage BCMS!

The Map Task Corpus of Heritage BCMS (Bosnian/Croatian/Montenegrin/Serbian) contains 30 short audio recordings of second-generation BCMS speakers who grew up in Switzerland. The corpus was designed and developed in the courses "Corpus Linguistics" (fall semester 2019) and "BCMS as a heritage language" (spring semester 2020) at the Department of Slavonic Languages and Literatures in Zurich.

The recorded conversations are Map Tasks: elicited conversations in which two speakers solve a collaborative task. In each case, a speaker explains to his/her counterpart how he/she should draw a line from point A to point B past various illustrations. When selecting the subjects, an attempt was made to recruit speakers from different (language) areas of former Yugoslavia. After the recordings, metadata about the speakers was collected. A selection of the metadata can be seen in the Metadata table.

The aim of the course project was to make an initial assessment of the knowledge of one's own and "foreign" BCMS varieties with the help of short but comparable conversations. The corpus of Map Task conversations represents a pilot study investigating the BCMS in Switzerland and serves to identify the relevant aspects of the linguistic variation as an inspiration for further research questions. For more information about the corpus please see our paper and the corpus repository on CLARIN.SI.

Lemmenmeier-Batinić, D., Batinić, J. & Escher, A. Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland. Lang Resources & Evaluation 57, 1607–1644 (2023). https://doi.org/10.1007/s10579-023-09634-7


Metadata

The data in the metadata table can be sorted, filtered and exported. The father's place of birth is marked on the folder as the speaker's place of origin. Other relevant places for each subject are marked in a separate map places.

For visualising the data we used Leaflet, OpenStreetMap and DataTables.


Uploads

In the Uploads page users can upload their custom annotations, share them with other users, and view shared custom annotations.

Transcripts

In the HTML transcripts (transcript example), the annotated data (code-switching, breaks, non-standard language words, etc.) are accessible. Users can also annotate the transcripts themselves according to popular categories and export the selection. A demo version of the XSLT that converts TEI files to HTML can be downloaded here. In addition, we provide audio files, transcripts in .xml (TEI), .fln, .flk, and textGrid format, as well as scans of solved map tasks.
Transkript (HTML)Audiodatei (WAV)XML Transkript (TEI)FOLKER transcript (FLK)Normalised FLK-transcript (FLN)Praat (textGrid)Map Task Bild
DL_01_BDL_01_B.wavDL_01_B.xmlDL_01_B.flkDL_01_B.flnDL_01_B.textGridDL_01_B.jpg
DM_01_BDM_01_B.wavDM_01_B.xmlDM_01_B.flkDM_01_B.flnDM_01_B.textGridDM_01_B.jpg
DM_01_ADM_01_A.wavDM_01_A.xmlDM_01_A.flkDM_01_A.flnDM_01_A.textGridDM_01_A.jpg
DL_01_ADL_01_A.wavDL_01_A.xmlDL_01_A.flkDL_01_A.flnDL_01_A.textGridDL_01_A.jpg
HK_01_AHK_01_A.wavHK_01_A.xmlHK_01_A.flkHK_01_A.flnHK_01_A.textGridHK_01_A.jpg
HK_01_BHK_01_B.wavHK_01_B.xmlHK_01_B.flkHK_01_B.flnHK_01_B.textGridHK_01_B.jpg
SB_01_BSB_01_B.wavSB_01_B.xmlSB_01_B.flkSB_01_B.flnSB_01_B.textGridSB_01_B.jpg
SB_01_ASB_01_A.wavSB_01_A.xmlSB_01_A.flkSB_01_A.flnSB_01_A.textGridSB_01_A.jpg
MB_01_AMB_01_A.wavMB_01_A.xmlMB_01_A.flkMB_01_A.flnMB_01_A.textGridMB_01_A.jpg
SV_01_BSV_01_B.wavSV_01_B.xmlSV_01_B.flkSV_01_B.flnSV_01_B.textGridSV_01_B.jpg
MB_01_BMB_01_B.wavMB_01_B.xmlMB_01_B.flkMB_01_B.flnMB_01_B.textGridMB_01_B.jpg
SV_01_ASV_01_A.wavSV_01_A.xmlSV_01_A.flkSV_01_A.flnSV_01_A.textGridSV_01_A.jpg
GK_03_AGK_03_A.wavGK_03_A.xmlGK_03_A.flkGK_03_A.flnGK_03_A.textGridGK_03_A.jpg
JA_01_BJA_01_B.wavJA_01_B.xmlJA_01_B.flkJA_01_B.flnJA_01_B.textGridJA_01_B.jpg
GK_01_BGK_01_B.wavGK_01_B.xmlGK_01_B.flkGK_01_B.flnGK_01_B.textGridGK_01_B.jpg
LM_02_ALM_02_A.wavLM_02_A.xmlLM_02_A.flkLM_02_A.flnLM_02_A.textGridLM_02_A.jpg
GK_03_BGK_03_B.wavGK_03_B.xmlGK_03_B.flkGK_03_B.flnGK_03_B.textGridGK_03_B.jpg
JA_01_AJA_01_A.wavJA_01_A.xmlJA_01_A.flkJA_01_A.flnJA_01_A.textGridJA_01_A.jpg
GK_01_AGK_01_A.wavGK_01_A.xmlGK_01_A.flkGK_01_A.flnGK_01_A.textGridGK_01_A.jpg
LM_02_BLM_02_B.wavLM_02_B.xmlLM_02_B.flkLM_02_B.flnLM_02_B.textGridLM_02_B.jpg
GK_04_BGK_04_B.wavGK_04_B.xmlGK_04_B.flkGK_04_B.flnGK_04_B.textGridGK_04_B.jpg
OW_01_AOW_01_A.wavOW_01_A.xmlOW_01_A.flkOW_01_A.flnOW_01_A.textGridOW_01_A.jpg
GK_04_AGK_04_A.wavGK_04_A.xmlGK_04_A.flkGK_04_A.flnGK_04_A.textGridGK_04_A.jpg
OW_01_BOW_01_B.wavOW_01_B.xmlOW_01_B.flkOW_01_B.flnOW_01_B.textGridOW_01_B.jpg
LM_01_ALM_01_A.wavLM_01_A.xmlLM_01_A.flkLM_01_A.flnLM_01_A.textGridLM_01_A.jpg
GK_02_BGK_02_B.wavGK_02_B.xmlGK_02_B.flkGK_02_B.flnGK_02_B.textGridGK_02_B.jpg
JR_01_BJR_01_B.wavJR_01_B.xmlJR_01_B.flkJR_01_B.flnJR_01_B.textGridJR_01_B.jpg
LM_01_BLM_01_B.wavLM_01_B.xmlLM_01_B.flkLM_01_B.flnLM_01_B.textGridLM_01_B.jpg
GK_02_AGK_02_A.wavGK_02_A.xmlGK_02_A.flkGK_02_A.flnGK_02_A.textGridGK_02_A.jpg
JR_01_AJR_01_A.wavJR_01_A.xmlJR_01_A.flkJR_01_A.flnJR_01_A.textGridJR_01_A.jpg
Transcript (HTML)Audio file (WAV)XML transcript (TEI)FOLKER transcript (FLK)Normalised FLK-transcript (FLN)Praat (textGrid)Map Task Image


Contributors
The corpus was developed in 2021/2022 by Dolores Lemmenmeier, Josip Batinić, and Anastasia Escher.
We thank all students and colleagues who contributed to the transcription, annotation and development of the corpus: Mirko Božić, Olivier-Andreas Winistörfer, Gabriela Kraljević, Linda Morf, Jovanka Antić, Samra Braković, Jovan Rosić, Dejana Jelena Milićević, Haris Kurtisi, Saša Vidić and Miro Rodin.
Image sources map task A
Market: Gimbal at Serbian Wikipedia, CC BY-SA 3.0 RS , via Wikimedia Commons; bag clip: Rotho; beetroot: kindPNG.com; hair clipper: A&A Hair Beauty; stapler: Ofrex; node: dreamstime.com (Pancaketom); key hole: Thegreenj, CC BY-SA 3.0 , via Wikimedia Commons; bat: PD-USGov, exact author unknown, Public domain, via Wikimedia Commons: ladder, garbage, soup spoon: unknown author.
Image sources map task B
Scissors: makeup.sk; sloth: Roy Toft; bowl: IKEA; cauliflower: shutterstock (Egor Rodynchenko); construction site: factumArchive; bread: Hubertus Schüler; rose hip: RGBStock.com (micromoth); turkey: dreamstime.com (Mike Neale); screwdriver: Narex; paperclip, windshield: unknown author.