This workshop aims to bring together people working on Universal Dependencies (UD) treebanks for Turkic languages. The workshop will be held in conjunction with the UniDive 2nd WG3 meeting in Istanbul on 8 September 2023. We also aim to facilitate online participation.
The focus of the workshop will be consistent annotations across different treebanks. We are also interested in linguistic phenomena in Turkic languages that are not easy (or even elegant) to annotate with the current UD guidelines.
Other topics, new treebanks for Turkic languages, (linguistic) research done on the treebanks, and other use of treebanks for research and practical applications, are welcome.
The workshop is supported by the UniDive COST action project.
For the afternoon discussions, we have prepared a list of Turkish sentences, considering the current treebanks and their issues. The set of Turkish sentences that demonstrate various issues raised by the participants with glosses and English translations can be found here. We intend to work on the translations or similar sentences demonstrating these issues in other Turkic languages during the workshop. You can find the conllu file of the Turkish sentences here. For reference, also they are in this page in plain text. If possible, we will also prepare a list of sentences for other Turkic languages, based on translations and their particular issues. We will discuss the issues in the sentences, and try to come up with solutions. Other sentences and translations are very much welcome.
Call for abstracts
We invite abstracts for short presentations (15 minutes + 5 minutes of Q&A) on topics concerning UD Turkic treebanks. The maintainers of present treebanks, and those who are preparing new treebanks are strongly encouraged to introduce their treebanks, and discuss interesting or difficult cases of annotation.
If you are interested in presenting your treebank in this workshop, please submit a short abstract (100 to 200 words) of your talk through easychair.org/conferences/?conf=udturkic2023 before August 28, 2023 (23:59 UTC). The abstract should include brief information about the treebank that you are maintaining or building, as well as the issues you would like to discuss (briefly) in your presentation. Abstracts on other topics concerning the UD Turkic treebanks are also welcome. The abstracts will receive light review from the organizers, and are mainly intended for planning and informational purposes.
Program
The workshop will be in two parts. In the morning session, we will have short presentations, followed by a discussion/panel session in the afternoon.
Times are in UTC+3, Istanbul time.
Friday, 8 September
- 08:45-09:00 - Opening remarks - Büşra Marşan, Çağrı Çöltekin
- 09:00-10:30 - Presentations
- 09:00:09:20 - Turkish GB treebank - Çağrı Çöltekin [abstract, presentation]
- 09:20-09:40 - Parallel Perspectives: ATIS Dependency Treebanks in English and Turkish - Aslı Kuzgun, Olcay Taner Yıldız, Mehmet Köse and Neslihan Cesur [abstract, presentation]
- 09:40-10:00 - Journey of TreeBanking - Olcay Taner Yildiz, Aslı Kuzgun, Neslihan Cesur, Bilge Nas Arıcan, Büşra Marşan, Neslihan Kara, Oğuzhan Kuyrukçu, Ezgi Sanıyar, Merve Özçelik, Arife Betül Yenice and Deniz Baran Aslan [abstract, presentation]
- 10:00-10:20 - BOUN Treebank v2.11 - Büşra Marşan, Furkan Akkurt, Tunga Güngör, Balkız Öztürk and Suzan Üsküdarlı [abstract, presentation]
- 10:30-11:00 - Coffee break
- 11:00-12:30 - Presentations and Introduction to the afternoon session
- 11:00-11:20 - Contextualizing the Present and Future of Old Turkish Annotation within Universal Dependencies for Turkic Languages - Mehmet Oguz Derin [abstract, presentation]
- 11:20-11:40 - Annotation issues in UD-Turkish - Büşra Marşan, Çağrı Çöltekin [presentation]
- 11:40-12:00 - Introduction to an annotation tool for UD treebanks (BoAT) - Büşra Marşan, Furkan Akkurt
- 12:00-12:30 - How to proceed in discussions
- 12:30-14:00 - Lunch break
- 14:00-15:30 - Discussion and Presentations
- 14:00-14:20 - UD-Tatar NMCTT Treebank: Issues in annotation across Turkic - Chihiro Taguchi [abstract, presentation]
- 14:20-14:40 - Towards a UD treebank for Kyrgyz - Aida Kasieva, Gulnura Dzhumalieva, Anna Thompson and Jonathan Washington [abstract, presentation]
- 14:40-15:30 - Discussion on annotation issues
- 15:30-16:00 - Coffee break
- 16:00-17:30 - Discussion and Wrap-up
- 16:00-17:15 - Discussion on annotation issues
- 17:15-17:30 - Wrap-up and future plans
Organizers
- Çağrı Çöltekin, University of Tübingen
- Büşra Marşan, Boğaziçi University
- Suzan Üsküdarlı, Boğaziçi University
- Tunga Güngör, Boğaziçi University
- Furkan Akkurt, Boğaziçi University
Contact point for questions / help: udtw-organisers@googlegroups.com