Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
thatcherthorn
on Dec 6, 2023
|
parent
|
context
|
favorite
| on:
Gemini AI
They've reported surpassing GPT4 on several benchmarks. Does anyone know of these are hand picked examples or is this the new SOTA?
xiphias2
on Dec 6, 2023
|
next
[–]
It will be SOTA maybe when Gemini Ultra is available. GPT-4 is still SOTA.
philomath_mn
on Dec 6, 2023
|
parent
|
next
[–]
Usually SOTA status is established when the benchmark paper is released (probably after some review). But GPT4 is the current generally-available-SOTA
silveraxe93
on Dec 6, 2023
|
parent
|
prev
|
next
[–]
They also compare to RLHFed GPT-4, which reduces capabilities, while their model seems to be pre-RLHF. So I'd expect those numbers to be a bit inflated compared to public release.
williamstein
on Dec 6, 2023
|
prev
[–]
They certainly claim it is SOTA for multimodal tasks: “Gemini surpasses SOTA performance on all multimodal tasks.”
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: