Module talk:lo-translit/testcases

From Wiktionary, the free dictionary
Latest comment: 7 years ago by Atitarev in topic ຂໍ້ຄວາມ
Jump to navigation Jump to search

ZWSP[edit]

Should ZWSP be transliterated? Could it be converted to nothing? --Anatoli (обсудить/вклад) 06:32, 13 June 2014 (UTC)Reply

Removed. Wyang (talk) 06:36, 13 June 2014 (UTC)Reply
ZWSP ought to be helpful, but in practice places like the Lao Wikipedia are full of erroneous ZWSP seemingly randomly inserted in the middle of syllables. I guess this must be produced by some flaky tools out there. Ignoring them for now does seem to be the best practical option. — hippietrail (talk) 05:05, 3 September 2014 (UTC)Reply

Template changes needed?[edit]

Have a look at what I did as a test; note the hidden category, if possible. --Lo Ximiendo (talk) 06:52, 13 June 2014 (UTC)Reply

Too early, I think. I'm sure Wyang or someone else will be able to do it with a bot. --Anatoli (обсудить/вклад) 06:58, 13 June 2014 (UTC)Reply

sample text with xlit[edit]

sā thā la na lat pa sā thi pa tai pa sā son lāo (kham khao: sā thān na rat pa sā thip taip sā son lāo[2]) lư̄ sap pa lāo tang yū thit tā wen ʼǭk sīang tai khǭng tha wīp ʼā sī, yū chai kāng khǭng lǣm ʼin dū chīn la wāng sen kha nān thī - ʼong sā nư̄a læ sen wǣng thī ʼong sā sap pa lāo mī nư̄a thī thang mot tā lāng ki lō mæt pen pa thēt thī bǭ mī thāng ʼǭk sū tha lē, mī sāi dǣn tit kap sā thā la na lat pa sā son chīn ki lō mæt), thit tai tit kap lā sa ʼā nā chak kam pū chīa ki lō mæt), thit tā wen ʼǭk tit kap sā thā la na lat sang khom ni nyom wīat nām ki lō mæt thit tā wen tok tit kap lā sa ʼā nā chak thai ki lō mæt læ thit tā wen tok sīang nư̄a tit kap sā thā la na lat ngǣ sa ha phāp mīan mā ki lō mæt sa.pa.pa.lāo pen pa thēt diāu nai phāk phư̄n nī thī bǭ mī sāi dǣn tit kap tha lē. --Anatoli (обсудить/вклад) 07:30, 13 June 2014 (UTC)Reply

ສຍາມ[edit]

I thought Lao doesn't have unwritten vowels before I saw ສຍາມ (sa nyām). This must be an exception - a literal borrowing from Thai. Perhaps a phonetic respelling is required, if this is an exception? --Anatoli (обсудить/вклад) 04:50, 16 June 2014 (UTC)Reply

Anatoli, the communist junta imposed a new orthography when they came into power in the 1970s. Before that the orthography was much more like Thai, with inherent vowels and unpronounced consonant clusters, etc. The new orthography is a lot more phonetic and in places a lot different to Thai and the old Lao orthography. But Laos is a small poor country and there's actually very little writing, which I guess is the reason why it's still common to find a lot of old orthography amongst the writing that is out there. The Lao Wikipedia is full of it. Even some new words defy the new orthography. The word for AIDS has a cluster for the final consonant for instance.
ສຍາມ is old orthography. It's probably still out there because it's the word for Siam, which is itself an obsolete word. The new orthography word is the word for Thailand rather than Siam. I believe there is a new orthography spelling for Siam too, but my reading of SEAlang suggests that it overlaps with with another word or meaning so I haven't tried to add it to Wiktionary. One odd thing I have found is a spelling for Thailand which uses a final -y in combination with a left ai- vowel. This is usually an impossible combination in Lao orthography but seems to be a direct Thai to Lao transliteration. — hippietrail (talk) 05:14, 3 September 2014 (UTC)Reply

ຂໍ້ຄວາມ[edit]

@Wyang, Octahedron80 Hi. Is it even possible to determine syllable boundaries for Lao ຂໍ້ຄວາມ (khǭk wām), which is = Thai ข้อความ (kɔ̂ɔ-kwaam)? --Anatoli T. (обсудить/вклад) 03:12, 17 September 2016 (UTC)Reply

I think ຂໍ້ຄ may be an invalid syllable in Lao, but I'm sure Octahedron80 would know more. Wyang (talk) 11:22, 17 September 2016 (UTC)Reply
@Wyang, Octahedron80 Thanks Frank, sorry, I missed the reply. In ເອຣີເຕຣ (ʼē rī tēn, Eritrea), the syllabification should be "ʼē-rī-tēn" or manual override should be enabled. Perhaps Lao, like Thai, needs syllable breaks "-". --Anatoli T. (обсудить/вклад) 08:50, 16 October 2016 (UTC)Reply