Loading
Loading
This benchmark tests how LLM manage semantic and emotional exception in Brazilian Portuguese expressions
The questions are made by Matteo Sisti
92
Total Prompts
1055
Scored Responses
4
Contributors
32%
Average Overall Score
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
1 | x-ai/grok-4.1-fast | 0.44 | 92 | 14ms |
2 | x-ai/grok-4-fast | 0.37 | 92 | 6ms |
3 | google/gemini-2.5-flash-lite | 0.33 | 92 | 2ms |
4 | meta-llama/llama-4-maverick | 0.27 | 92 | 3ms |
5 | meta-llama/llama-3.3-70b-instruct:free | 0.14 | 88 | 3ms |