obvithrowaway34434

2025-03-23 15:40:05

Remember how the whole of Reddit and other social media were so convinced OpenAI was done for in January?

obvithrowaway34434

2025-03-23 13:18:38

A new study finds that individuals randomly assiged to use AI did as well as teams of two people and were happier as well

lessis_amess

2025-03-22 11:44:18

OpenAI released GPT-4.5 and O1 Pro via their API and it looks like a weird decision.

namanyayg

2025-03-21 03:55:28

Vibe Coding is a Dangerous Fantasy

PestoPastaLover

2025-03-21 01:09:37

Why no mid-teir? I feel like OpenAI is missing a huge potential here.

obvithrowaway34434

2025-03-20 06:03:37

New study from METR suggests the length of tasks AI models can handle is doubling every 7 months, suggesting automating week or month long tasks is less than 5 years away

obvithrowaway34434

2025-03-20 06:08:18

New study from METR suggests the length of tasks AI models can handle is doubling every 7 months, suggesting automating week- or month-long tasks is less than 5 years away

Alex__007

2025-03-19 08:37:40

According to Bloomberg, Open AI Operator can't even book a simple flight, and agents as a whole are really struggling to deliver any value...

obvithrowaway34434

2025-03-15 04:24:47

DeepSeek's owner asked R&D staff to hand in passports so they can't travel abroad. How does this make any sense considering Deepseek open sources everything?

obvithrowaway34434

2025-03-10 01:18:27

Manus turns out to be just Claude Sonnet + 29 other tools, Reflection 70B vibes ngl

obvithrowaway34434

2025-03-10 00:40:30

Manus turns out to be just Claude Sonnet + 29 other tools

obvithrowaway34434

2025-03-10 00:27:57

So the much-hyped Manus AI agent from China turns out to be just Claude Sonnet + 29 other tools

Growsomedope

2025-03-08 05:42:59

Jokic, who was questionable tonight with an ankle injury, becomes the first player with 30 pts, 20 reb, 20 ast in a game in NBA history

Healthy-Nebula-3603

2025-03-07 20:48:08

QwQ on LiveBench - is better than Sonnet 3.7 (non thinking)!

LoretiTV

2025-03-07 02:35:29

Severance - 2x08 "Sweet Vitriol" - Post-Episode Discussion

Longjumping_War4808

2025-03-07 02:52:42

What's the point of local LLM for coding?

obvithrowaway34434

2025-03-05 10:09:49

o1 like image generator next? This could be game changing if it works!

obvithrowaway34434

2025-03-05 09:46:47

OpenAI's next image generation will likely have some kind of chain of thought/inference time compute usage, probably based on GPT-4o. This could be very interesting.

obvithrowaway34434

2025-03-02 08:49:39

GPT-4.5 creates a Louis CK style standup routine. This material is new afaik and genuinely funny. I haven't seen any model generate anything remotely close to this

miladkhademinori

2025-03-01 21:42:27

How is Sesame not all everyone is talking about today? This blows ChatGPT Voice out of the water. I am in awe!

Longjumping-Stay7151

2025-02-28 06:40:21

GPT-4.5 is a base model. Just compare other thinking models to their non-thinking versions to see what's coming.

GOD-SLAYER-69420Z

2025-02-24 20:32:02

The big week has started with an absolute banger!!!!! Claude 3.7 sonnet absolutely crushes every single competitor in real world coding tasks by a large margin

Ill_Shirt_6013

2025-02-24 20:42:38

Claude 3.7 results in the Aider Polyglot benchmark

Mr_Hyper_Focus

2025-02-24 22:03:46

3.7 sonnet LiveBench results are in

stealthispost

2025-02-24 02:50:47

Everyone is catching up.

popjoe123

2025-02-23 23:36:46

Everyone is catching up.

Share Your Mood