![]() ![]() ![]() The models use only relevant factors that help to increase performance and ignore data from other factors, thus also reducing the need for large morphosyntactically tagged training corpora. In this paper, we present a method for building factored language models that use data obtained by morphosyntactic tagging. However, this introduces demands for sufficiently large training corpora and proper methods of using the additional information. ![]() ![]() The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. The findings will be implemented in future editions of the Gigafida corpus, but are relevant to any project that aims at facilitating a wider use of reference corpora and corpus-based resources. The evaluation also highlighted some relevant user habits, such as not taking the time to learn systematically about the tool before they start using it. In the user feedback, the simplicity of search options and the interface clarity are highlighted as the main advantages, while for the future development, advanced visualizations of corpus data and improved search of word-phrases are suggested. the FidaPLUS corpus within the ASP32 concordancer (rated with 3.67). This is a considerable improvement in user experience from the previous reference corpus of Slovene, i.e. “The corpus results are displayed in a clear manner”). With an average rate of 4.36 on a scale between 1 and 5 (1 = I strongly disagree, 5 = I strongly agree), the results indicate that most survey participants agreed or strongly agreed with positive statements about the new implementations (e.g. Two years after the corpus was published within the new tool, its features were assessed by the users. In the same project, a web concordancer was designed for the broadest possible use, and tailored to the needs and abilities of user groups such as translators, writers, proofreaders and teachers. The latest reference corpus of written Slovene, the Gigafida corpus, was created as part of the ‘Communication in Slovene’ project. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |