Adjusting Tokenizer Training for Domain-specific Large Language Models – The Case of Serbian Legal Domain
XVII International Conference on Systems, Automatic Control and Measurements, SAUM 2024 (pp. 71-75) АУТОР(И) / AUTHOR(S): Jelena Kocić , Miloš Bogdanović , Milena Frtunić Gligorijević , Leonid Stoimenov Download Full Pdf DOI: 10.46793 САЖЕТАК / ABSTRACT: The advancement of large-scale language…