← All comparisons

Trinity Large Thinking vs Qwen3 VL 4B (Reasoning)

Arcee AI vs Alibaba — side-by-side benchmark comparison

Trinity Large ThinkingQwen3 VL 4B (Reasoning)
Intelligence Index31.913.7
Coding Index27.26.7
Math Index25.7
Output speed (tok/s)171.40.0
Blended price ($/1M)$0.40$0.00
Time to first token (s)0.67s0.00s
aime
aime 2525.7%
artificial analysis coding index27.206.70
artificial analysis intelligence index31.9013.70
artificial analysis math index25.70
gpqa75.2%49.4%
hle14.7%4.4%
ifbench56.3%36.6%
lcr33.0%21.3%
livecodebench32.0%
math 500
mmlu pro70.0%
scicode36.1%17.1%
tau290.1%15.5%
terminalbench hard22.7%1.5%

Benchmark data from Artificial Analysis.