Wiktionary talk:Frequency lists/Czech

From Wiktionary, the free dictionary
Latest comment: 13 years ago by Dan Polansky in topic Deletion
Jump to navigation Jump to search

Deletion[edit]

The frequency lists from Czech national corpus are not released under a suitable license. The lists are available from http://ucnk.ff.cuni.cz/srovnani10.php. I have contacted Michal Křen, a person who took part on publishing the lists, and he has answered that the license of the material has not yet been determined by UCNK (Ústav Českého národního korpusu, Institute for Czech national corpus). Thus, there is no license that makes it possible to place this material into Wiktionary, in which all material has to be released under CC-BY-SA. All material published on the Internet is automatically copyrighted and can be republished only if an explicit license allows to do to. --Dan Polansky 12:45, 18 May 2011 (UTC)Reply

Though, it doesn't seem to be identical to the page you're linking to, why is that? Mglovesfun (talk) 17:51, 18 May 2011 (UTC)Reply
I note that this page was created by WF and that the material continues to be available at cs:Wikislovník:Frekvenční_seznamy/ČNK_SYN2005/1-1000 (not our business). The licensing issue sounds very plausible so there should be no need to discuss this any further here. -- Gauss 23:08, 18 May 2011 (UTC)Reply
@MG, if you go to the section "Download" of http://ucnk.ff.cuni.cz/srovnani10.php, download syn2005_lemma.zip from there, and unzip the file, you get a tab-separated text file that contains a frequency list with various frequencies. This tab-separated text file was used for the creation of the wiki pages. The wiki pages contained less information than the tab-separated text file, as they did not state any frequencies; the information they did contain was the particular order of words identical to the order of words in the tab-separated text file. --Dan Polansky 08:56, 19 May 2011 (UTC)Reply
@Gauss: I have notified Czech Wiktionary of these lists, in an analogue of Beer parlour called "Pod lípou": cs:Wikislovník:Pod_lípou#Frekvenční seznamy a jejich licence, 18. 5. 2011. --Dan Polansky 09:00, 19 May 2011 (UTC)Reply