TechTalks
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Beyond chain-of-thought: A look at the Hierarchical Reasoning Model
With 27M parameters and 1,000 examples, HRM beats top LLMs on key reasoning benchmarks
19 hrs ago
•
Ben Dickson
6
Share this post
TechTalks
Beyond chain-of-thought: A look at the Hierarchical Reasoning Model
Copy link
Facebook
Email
Notes
More
What we know so far about Gemini 2.5 Deep Think
A look inside Google’s Gemini 2.5 Deep Think, the AI that uses extended "slow thinking" to solve complex math and code problems.
Aug 4
•
Ben Dickson
4
Share this post
TechTalks
What we know so far about Gemini 2.5 Deep Think
Copy link
Facebook
Email
Notes
More
July 2025
LegalPwn: The prompt injection attack hiding in the fine print of your code
LegalPwn, a new prompt injection attack, uses fake legal disclaimers to trick major LLMs into approving and executing malicious code.
Jul 31
•
Ben Dickson
7
Share this post
TechTalks
LegalPwn: The prompt injection attack hiding in the fine print of your code
Copy link
Facebook
Email
Notes
More
1
MIT introduces new RL technique that moves beyond binary rewards
AI models are often overconfident. A new MIT training method teaches them self-doubt, improving reliability and making them more trustworthy.
Jul 29
•
Ben Dickson
4
Share this post
TechTalks
MIT introduces new RL technique that moves beyond binary rewards
Copy link
Facebook
Email
Notes
More
1
How Windsurf came back from the brink
Back from the brink, Windsurf is "friends again" with Anthropic, integrating Claude Sonnet 4 and Devin in a bold new survival strategy.
Jul 24
•
Ben Dickson
3
Share this post
TechTalks
How Windsurf came back from the brink
Copy link
Facebook
Email
Notes
More
New research reveals critical flaw in LLM-as-a-judge methods
Researchers discover critical vulnerability in LLM-as-a-judge reward models that could compromise the integrity and reliability of your AI training…
Jul 22
•
Ben Dickson
10
Share this post
TechTalks
New research reveals critical flaw in LLM-as-a-judge methods
Copy link
Facebook
Email
Notes
More
A first look at the new ChatGPT agent (and how it can change the internet)
OpenAI's powerful new ChatGPT Agent redefines AI capabilities while introducing new risk and attack vectors in security and data integrity of AI…
Jul 19
•
Ben Dickson
7
Share this post
TechTalks
A first look at the new ChatGPT agent (and how it can change the internet)
Copy link
Facebook
Email
Notes
More
2
Inside the semantic attack that fools Grok-4 (and other LLMs)
Researchers jailbroke Grok-4 using a combined attack. The method manipulates conversational context, revealing a new class of semantic vulnerabilities.
Jul 17
•
Ben Dickson
6
Share this post
TechTalks
Inside the semantic attack that fools Grok-4 (and other LLMs)
Copy link
Facebook
Email
Notes
More
1
A reality check on the 'emergent abilities' of LLMs
A new paper argues that "emergent abilities" in LLMs aren't true intelligence. The difference is crucial and has implications for real-world…
Jul 15
•
Ben Dickson
11
Share this post
TechTalks
A reality check on the 'emergent abilities' of LLMs
Copy link
Facebook
Email
Notes
More
2
How Windsurf became a casualty of the AI arms race
Caught between tech giants, AI startup Windsurf became the main casualty when Google poached its leaders, gutting OpenAI's $3B acquisition deal.
Jul 13
•
Ben Dickson
5
Share this post
TechTalks
How Windsurf became a casualty of the AI arms race
Copy link
Facebook
Email
Notes
More
1
Tackling the challenges of AI in space
An AI engineer explains how a new breed of tiny, hyper-efficient ML models is revolutionizing space, extending satellite life and unlocking the next…
Jul 8
•
Ben Dickson
18
Share this post
TechTalks
Tackling the challenges of AI in space
Copy link
Facebook
Email
Notes
More
Does aligning LLMs with human cognition come at the cost of less powerful models?
To make AI more human-like, must we sacrifice its power? A new study shows why LLM efficiency creates a gap in understanding.
Jul 1
•
Ben Dickson
6
Share this post
TechTalks
Does aligning LLMs with human cognition come at the cost of less powerful models?
Copy link
Facebook
Email
Notes
More
1
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts