Data and source code accompanying the talk:
У. В. Парыцкі. Украінскі 14-складовы верш у беларускай паэзіі: рытміка перакладаў і імітацый // X Міжнародны Кангрэс даследчыкаў Беларусі, Коўна, 01.10.2022 [Vladislav Poritski. Ukrainian 14-syllable verse in Belarusian poetry: the rhythm of translations and imitations // Presented at 10th International Congress of Belarusian Studies, Kaunas, 01.10.2022] The empirical investigation of 14-syllable verse, presented in the talk, is based upon a sample of Ukrainian texts by Taras Shevchenko, their Belarusian translations, and original Belarusian poetry by Yanka Kupala, Yakub Kolas, Piatruś Brouka. The dataset structure is as follows: ./0_plain – plain texts; ./1_accentuated – accentuated texts; metadata_shevchenko.tsv, metadata_be_authors.tsv – metadata files describing the texts; make_reports.py – a Python script to generate statistic reports from the accentuated texts; ./2_reports – programmatically generated reports; slides.tex – LaTeX source code of the talk's slides, where the reports are embedded as diagrams and tables; slides.pdf – PDF version of the slides. The directories ./0_plain, ./1_accentuated, ./2_reports are provided in .zip archives. Belarusian translations of Taras Shevchenko's poetry have been taken from the book:
Т. Р. Шаўчэнка. Вершы. Паэмы. Мінск: Мастацкая літаратура, 1989.
(scan copy available at https://files.knihi.com/Knihi/scanned/Saucenka.Viersy_paemy.djvu)
Each poem is stored in a separate .txt file. The file name indicates the number of the poem's first page in the scanned book, e.g.: 021.txt. Same names are used for the respective Ukrainian texts. In each pair of files, such as e.g. ./0_plain/uk/021.txt and ./0_plain/be/021.txt, the texts are aligned line by line. Poem titles in both languages, translator names, and the URLs of Ukrainian source texts are provided in metadata_shevchenko.tsv. Original Belarusian poetry, kept in ./0_plain/be, doesn't require any alignment, and the naming scheme is different. Poem titles, author names, and the URLs of Belarusian source texts are provided in metadata_be_authors.tsv. In all Ukrainian and Belarusian texts, metrically irrelevant lines are discarded, only 14-syllable verse lines are stored, each of them split graphically into 8+6 syllables. Occasional minor violations, i.e. ± one or two syllables, are allowed in the texts but ignored in the statistic reports. No spans shorter than a pair of rhyming 14-syllable lines (or, graph...
У. В. Парыцкі. Украінскі 14-складовы верш у беларускай паэзіі: рытміка перакладаў і імітацый // X Міжнародны Кангрэс даследчыкаў Беларусі, Коўна, 01.10.2022 [Vladislav Poritski. Ukrainian 14-syllable verse in Belarusian poetry: the rhythm of translations and imitations // Presented at 10th International Congress of Belarusian Studies, Kaunas, 01.10.2022] The empirical investigation of 14-syllable verse, presented in the talk, is based upon a sample of Ukrainian texts by Taras Shevchenko, their Belarusian translations, and original Belarusian poetry by Yanka Kupala, Yakub Kolas, Piatruś Brouka. The dataset structure is as follows: ./0_plain – plain texts; ./1_accentuated – accentuated texts; metadata_shevchenko.tsv, metadata_be_authors.tsv – metadata files describing the texts; make_reports.py – a Python script to generate statistic reports from the accentuated texts; ./2_reports – programmatically generated reports; slides.tex – LaTeX source code of the talk's slides, where the reports are embedded as diagrams and tables; slides.pdf – PDF version of the slides. The directories ./0_plain, ./1_accentuated, ./2_reports are provided in .zip archives. Belarusian translations of Taras Shevchenko's poetry have been taken from the book:
Т. Р. Шаўчэнка. Вершы. Паэмы. Мінск: Мастацкая літаратура, 1989.
(scan copy available at https://files.knihi.com/Knihi/scanned/Saucenka.Viersy_paemy.djvu)
Each poem is stored in a separate .txt file. The file name indicates the number of the poem's first page in the scanned book, e.g.: 021.txt. Same names are used for the respective Ukrainian texts. In each pair of files, such as e.g. ./0_plain/uk/021.txt and ./0_plain/be/021.txt, the texts are aligned line by line. Poem titles in both languages, translator names, and the URLs of Ukrainian source texts are provided in metadata_shevchenko.tsv. Original Belarusian poetry, kept in ./0_plain/be, doesn't require any alignment, and the naming scheme is different. Poem titles, author names, and the URLs of Belarusian source texts are provided in metadata_be_authors.tsv. In all Ukrainian and Belarusian texts, metrically irrelevant lines are discarded, only 14-syllable verse lines are stored, each of them split graphically into 8+6 syllables. Occasional minor violations, i.e. ± one or two syllables, are allowed in the texts but ignored in the statistic reports. No spans shorter than a pair of rhyming 14-syllable lines (or, graph...