Archive - TechTalks

Beyond chain-of-thought: A look at the Hierarchical Reasoning Model

With 27M parameters and 1,000 examples, HRM beats top LLMs on key reasoning benchmarks

19 hrs ago •

What we know so far about Gemini 2.5 Deep Think

A look inside Google’s Gemini 2.5 Deep Think, the AI that uses extended "slow thinking" to solve complex math and code problems.

Aug 4 •

July 2025

LegalPwn: The prompt injection attack hiding in the fine print of your code

LegalPwn, a new prompt injection attack, uses fake legal disclaimers to trick major LLMs into approving and executing malicious code.

Jul 31 •

MIT introduces new RL technique that moves beyond binary rewards

AI models are often overconfident. A new MIT training method teaches them self-doubt, improving reliability and making them more trustworthy.

Jul 29 •

How Windsurf came back from the brink

Back from the brink, Windsurf is "friends again" with Anthropic, integrating Claude Sonnet 4 and Devin in a bold new survival strategy.

Jul 24 •

New research reveals critical flaw in LLM-as-a-judge methods

Researchers discover critical vulnerability in LLM-as-a-judge reward models that could compromise the integrity and reliability of your AI training…

Jul 22 •

A first look at the new ChatGPT agent (and how it can change the internet)

OpenAI's powerful new ChatGPT Agent redefines AI capabilities while introducing new risk and attack vectors in security and data integrity of AI…

Jul 19 •

Inside the semantic attack that fools Grok-4 (and other LLMs)

Researchers jailbroke Grok-4 using a combined attack. The method manipulates conversational context, revealing a new class of semantic vulnerabilities.

Jul 17 •

A reality check on the 'emergent abilities' of LLMs

A new paper argues that "emergent abilities" in LLMs aren't true intelligence. The difference is crucial and has implications for real-world…

Jul 15 •

How Windsurf became a casualty of the AI arms race

Caught between tech giants, AI startup Windsurf became the main casualty when Google poached its leaders, gutting OpenAI's $3B acquisition deal.

Jul 13 •

Tackling the challenges of AI in space

An AI engineer explains how a new breed of tiny, hyper-efficient ML models is revolutionizing space, extending satellite life and unlocking the next…

Jul 8 •

Does aligning LLMs with human cognition come at the cost of less powerful models?

To make AI more human-like, must we sacrifice its power? A new study shows why LLM efficiency creates a gap in understanding.

Jul 1 •

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts