Наредни састанак Семинара биће одржан онлајн у среду, 19. априла 2023, са почетком у 19 часова.

Предавач: Joao Paulo Carvalho, INESC-ID/Instituto Superior Técnico, Universidade de Lisboa, Portugal

Наслов предавања: FUZZY FINGERPRINTS: FROM USER IDENTIFICATION TO LARGE LANGUAGE MODELS

Апстракт: Fuzzy Fingerprints (FFP) are a frequency-based compact classification technique inspired by human fingerprints. They usually perform and compare well against other machine learning techniques when the number of classes is very large. FFPs have been used and adapted for tasks such as mobile phone user identification, web user identification, text authorship attribution, Tweet Topic Detection, prediction of Intensive Care Unit readmissions from medical text notes, Memory-based Collaborative filtering solutions in the Recommendation Systems domain, or cyberbullying detection in social networks. FFPs have been successfully used as an interpretable text classification technique, but, like most other techniques, have lately been largely surpassed in performance by Large Pre-trained Language Models (LLM), such as BERT or RoBERTa. However, these LLM suffer from the lack of interpretability and explainability. Recently we were able to combine the interpretability and compact characteristics of the FFP framework with the robustness of the large pre-trained models and shown that, even with a small FFP size, this new architecture can generalize and compete with the results from fine-tuned LLM models.

Напомена: Регистрациона форма за учешће на Семинару је доступна на линку:
https://miteam.mi.sanu.ac.rs/asset/CW5nJWDSEZDj7p32p

Уколико желите само да пратите предавање без могућности активног учешћа, пренос је доступан на линку:
https://miteam.mi.sanu.ac.rs/asset/4LNW8WtML7rLKojoz