cardiffnlp/super_tweeteval
Viewer • Updated • 255k • 2.91k • 15
A new multilingual dataset for tweet intimacy analysis, MINT, is benchmarked against popular multilingual pre-trained language models.
We propose MINT, a new Multilingual INTimacy analysis dataset covering 13,372 tweets in 10 languages including English, French, Spanish, Italian, Portuguese, Korean, Dutch, Chinese, Hindi, and Arabic. We benchmarked a list of popular multilingual pre-trained language models. The dataset is released along with the SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis (https://sites.google.com/umich.edu/semeval-2023-tweet-intimacy).
Get this paper in your agent:
hf papers read 2210.01108 curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper
No Space linking this paper
No Collection including this paper