AI Testing

OverviewList HNewsList Category HNewsAI Testing - Hacker News - Why SWE-bench Verified no longer measures frontier coding capabilities

Add Comment

notice: please create a custom view template for the hackernewscore class view-hackernewscore.html

Why SWE-bench Verified no longer measures frontier coding capabilities

🚨 Frontier Coding Capabilities Put to the Test: The SWE-bench Verified Update

8:05 pm, April 26, 2026

guid

https://news.ycombinator.com/item?id=47910388

source_url

https://openai.com/index/why-we-no-longer-evaluate-swe-bench-verified/

author_name

kmdupree

id: 2212
uid: FnUuJ
insdate: 2026-04-26 20:05:39
title: Why SWE-bench Verified no longer measures frontier coding capabilities
additional:

🚨 Frontier Coding Capabilities Put to the Test: The SWE-bench Verified Update

OpenAI has announced that SWE-bench Verified will no longer measure frontier coding capabilities, citing limitations in accurately assessing advanced coding skills. This decision comes as the field of AI-powered coding continues to evolve rapidly. By moving away from SWE-bench Verified, OpenAI aims to refine its evaluation methods.
category: Hacker News
md5:
guid: https://news.ycombinator.com/item?id=47910388
source_url: https://openai.com/index/why-we-no-longer-evaluate-swe-bench-verified/
updated:
image:
author_name: kmdupree
author_link:

Add Comment

Nick Name Type in a Nick Name here

Comment

Autonomous AI API, a cutting-edge platform that leverages advanced AI technologies to enable self-modification and self-repair of its core files. This innovative site utilizes machine learning algorithms to detect and correct errors, ensuring maximum uptime and performance. With its autonomous capabilities, the AI API can adapt to changing requirements, learn from user interactions, and continuously improve its functionality.

View Details

cybersec Overview List Category cybersec List cybersec List Table cybersec Search cybersec

Images Overview List Category Images List Images List Table Images Search Images

Videos Overview List Category Videos List Videos List Table Videos Search Videos

Wiki Overview List Category Wiki List Wiki List Table Wiki Search Wiki

Page Views

This page has been viewed 4 times.

Search HNews

Search HNews by entering your search text above.

Category List HNews

"Cancel ChatGPT" movement goes mainstream after OpenAI closes deal with U.S. Dow
"Collaboration" Is Bullshit
"Disregard That" Attacks
"People who don't use AI will be left behind"
"Plain text has been around for decades and it's here to stay." – Unsung
"Special 301" Comments on Nintendo Game Piracy in Asia and Latin America (1994)
"That Shape Had None" – A Horror of Substrate Independence (Short Fiction)
"The new Copilot app for Windows 11 is really just Microsoft Edge"
"Warn about PyPy being unmaintained"
"We do not think Anthropic should be designated as a supply chain risk"
$500M for Virtual Biology Initiative, Funded by Zuckerbergs
$96 3D-printed rocket that recalculates its mid-air trajectory using a $5 sensor
'AI washing': firms are scrambling to rebrand themselves as tech-focused
'Fatal decision': EU slammed for caving to US pressure on digital rules
'Hairdryer used to trick weather sensor' to win Polymarket bet
'Kitten Space Agency', the Spiritual Successor to 'Kerbal Space Program' (2025)
'Miracle': Europe reconnects with lost spacecraft
'No Way to Prevent This,' Says Only Package Manager Where This Regularly Happens
'Point of no return': New Orleans relocation must start now due to sea level
'The Secret Agent': Exploring a Vibrant, yet Violent Brazil (2025)
'We mould trees to grow into the shape of chairs'
'Your Frustration Is the Product'
(Blender) Cosmology with Geometry Nodes
.de TLD offline due to DNSSEC?
.de domains were 'down' for 2 hours
/e/OS is a complete "deGoogled", mobile ecosystem
10-202: Introduction to Modern AI (CMU)
100M-Row Challenge with PHP
15 years, one server, 8GB RAM and 500k users – how Webminal refuses to die
1B identity records exposed in ID verification data leak
1M context is now generally available for Opus 4.6 and Sonnet 4.6
2% of ICML papers desk rejected because the authors used LLM in their reviews
2,100 Swiss municipalities showing which provider handles their official email
2-D Mathematical Curves
20 years on AWS and never not my job
2009 Aftonbladet Israel Controversy
245TB Micron 6600 ION Data Center SSD Now Shipping
3.4M Solar Panels
3D-Knitting: The Ultimate Guide
4-bit floating point FP4
404 Deno CEO not found
447 TB/cm² at zero retention energy – atomic-scale memory on fluorographane
560-610 minutes of exercise a week needed for substantial heart benefits
7 lines of code, 3 minutes: Implement a programming language (2010)
80386 Microcode Disassembled
8087 Emulation on 8086 Systems
81yo Dodgers fan can no longer get tickets because he doesn't have a smartphone
9 Mothers (YC P26) Is Hiring
9 Mothers (YC P26) Is Hiring – Lead Robotics and More
90% of Claude-linked output going to GitHub repos w <2 stars
90% of crypto's Illinois primary spending failed to achieve its objective
A Better R Programming Experience Thanks to Tree-sitter
A CPU that runs entirely on GPU
A Canonical Generalization of OBDD
A Claude Code and Codex Skill for Deliberate Skill Development
A Compiler Writing Journey
A Couple Million Lines of Haskell: Production Engineering at Mercury
A Decade of Docker Containers
A Decade of Slug
A Eureka machine that thinks like nature and explores what AI cannot
A Faster Alternative to Jq
A Few Good Magazines From the 70s and 80s
A GitHub Issue Title Compromised 4k Developer Machines
A HN post with negative points – how?
A Japanese Glossary of Chopsticks Faux Pas
A Japanese glossary of chopsticks faux pas (2022)
A Letter from Dijkstra on APL
A Look into NaviDial, Japan's Legacy Phone Service
A Mysterious Numbers Station Is Broadcasting Through the Iran War
A Nationwide Book Ban Bill Has Been Introduced in the House of Representatives
A Perfectable Programming Language
A Rave Review of Superpowers (For Claude Code)
A Report on Burnout in Open Source Software Communities (2025) [pdf]
A Roblox cheat and one AI tool brought down Vercel's platform
A Visual Introduction to Machine Learning
A WebGPU Implementation of Augmented Vertex Block Descent
A bit of fluid mechanics from scratch not from scratch
A blueprint for formal verification of Apple corecrypto
A cache-friendly IPv6 LPM with AVX-512 (linearized B+-tree, real BGP benchmarks)
A case for Go as the best language for AI agents
A compelling title that is cryptic enough to get you to take action on it
A couple million lines of Haskell: Production engineering at Mercury
A dot a day keeps the clutter away
A few words on DS4
A forecast of the fair market value of SpaceX's businesses
A fundamental principle of aeronautical engineering has been overturned
A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all
A grounded conceptual model for ownership types in Rust
A hidden workforce behind Meta’s new smart glasses
A mad undertaking: An undefinitive guide to the Aadam Jacobs collection
A most elegant TCP hole punching algorithm
A network smuggling Starlink tech into Iran to beat internet blackout
A new Bigfoot documentary helps explain our conspiracy-minded era
A new C++ back end for ocamlc
A new account made over $515,000 betting on the U.S. strike against Iran
A new gene therapy is giving people born deaf the chance to hear
A new spam policy for "back button hijacking"
A nicer voltmeter clock
A perfectable programming language
A playable DOOM MCP app
A recent experience with ChatGPT 5.5 Pro
A retro terminal music player inspired by Winamp
A scoping review of bicycling interventions’ impacts on well-being
A sea of sparks: Seeing radioactivity
A sentimental tour of late 1990s and early 2000s hacking tools
A simplified model of Fil-C
A standard protocol to handle and discard low-effort, AI-Generated pull requests
A statement about why RightsCon 2026 will not take place in Zambia
A sufficiently detailed spec is code
A tail-call interpreter in (nightly) Rust
A tale about fixing eBPF spinlock issues in the Linux kernel
A type-safe, realtime collaborative Graph Database in a CRDT
A web-based RDP client built with Go WebAssembly and grdp
A.I. note takers are making lawyers nervous
ABC News has taken all FiveThirtyEight articles offline
ADT says customer data stolen in cyber intrusion
AGPLv3§74 Empowers Users to Thwart Badgeware Like OnlyOffice
AI Error May Have Contributed to Girl's School Bombing in Iran
AI Is Breaking Two Vulnerability Cultures
AI Made Writing Code Easier. It Made Being an Engineer Harder
AI Product Graveyard
AI Skills as loader spec, not prompts – why the architecture changes everything
AI Slop Is Killing Online Communities
AI Tokens Are Mana
AI Tools Are Only as Good as Your Judgment – and That's the Point
AI and bots have officially taken over the internet
AI assistance when contributing to the Linux kernel
AI boom risks widening wealth divide, says BlackRock's Larry Fink
AI coding is gambling
AI could be the end of the digital wave, not the next big thing
AI cybersecurity is not proof of work
AI for American-Produced Cement and Concrete
AI got the blame for the Iran school bombing. The truth is more worrying
AI is a technology not a product
AI is just unauthorised plagiarism at a bigger scale
AI is making junior devs useless
AI is making me dumb
AI is wiping out entry-level jobs
AI may be making us think and write more alike
AI overly affirms users asking for personal advice
AI should elevate your thinking, not replace it
AI, Intimacy, and the Data You Never Meant to Share
AI-Assisted Cognition Endangers Human Development
AIs can't stop recommending nuclear strikes in war game simulations
AMD Am386 released March 2, 1991
AMD pulls a bait-and-switch on Linux users with Vivado licensing changes
AMD's Ryzen 9 9950X3D2 Dual Edition crams 208MB of cache into a single chip
APL is more French than English
ARM AGI CPU: Specs and SKUs
ASCII and Unicode quotation marks (2007)
AWS Engineer Reports PostgreSQL Perf Halved by Linux 7.0, Fix May Not Be Easy
AWS Fired the One Employee Who Gave a Damn
AWS Middle East Central Down, apparently struck in war
AWS stops billing Middle East cloud customers as repairs to war damage drag on
About 10% of AMC movie showings sell zero tickets. This site finds them
About LLMs at Zig Days
Academic fraud may be the symptom of a more systemic problem
Accelerando (2005)
Accelerate
Accelerating Gemma 4: faster inference with multi-token prediction drafters
Access to frontier AI will soon be limited by economic and security constraints
Ada 2022
Ada, Its Design, and the Language That Built the Languages
Adaptional (YC S25) Is Hiring Founding AI Engineers
Addressing Antigravity Bans and Reinstating Access
Adobe modifies hosts file to detect whether Creative Cloud is installed
Ads on Apple Maps
Advanced Mac Substitute is an API-level reimplementation of 1980s-era Mac OS
Advanced Quantization Algorithm for LLMs
Advice to Young People, the Lies I Tell Myself (2024)
Afrika Bambaataa, hip-hop pioneer, has died
Afroman Wins Civil Trial over Use of Police Raid Footage in His Music Videos
Afroman found not liable in defamation case brought by Ohio cops who raided home
After 20 years I turned off Google Adsense for my websites
After 20 years I turned off Google Adsense for my websites (2025)
After dissing Anthropic for limiting Mythos, OpenAI restricts access to Cyber
Ageless Linux – Software for humans of indeterminate age
Agent Reading Test
Agent Safehouse – macOS-native sandboxing for local agents
Agent Skills
Agent Skills – Open Security Database
Agent-harness-kit scaffolding for multi-agent workflows (MCP, provider-agnostic)
Agent-to-agent pair programming
Agentic Coding Is a Trap
Agentic Engineering Patterns
Agentic Patterns
Agents Aren't Coworkers, Embed Them in Your Software
Agents can now create Cloudflare accounts, buy domains, and deploy
Agents need control flow, not more prompts
Agents that run while I sleep
Agora-1: The Multi-Agent World Model
AirSnitch: Demystifying and breaking client isolation in Wi-Fi networks [pdf]
Airbus is preparing two uncrewed combat aircraft
Airline worker arrested after sharing photos of bomb damage in WhatsApp group
Alaska's oil revival sparks a new energy rush Into the Arctic
Alberta startup sells no-tech tractors for half price
Alexander Grothendieck Revolutionized 20th-Century Mathematics
Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment
Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs
All 12 moonwalkers had "lunar hay fever" from dust smelling like gunpowder
All 12 moonwalkers had "lunar hay fever" from dust smelling like gunpowder (2018)
All Four Sentinel-1 Satellites Are Now Live
All Those A.I. Note Takers? They're Making Lawyers Nervous
All elementary functions from a single binary operator
All my clients wanted a carousel, now it's an AI chatbot
All of human cooking compressed into 2 megabytes
All phones sold in the EU to have replaceable batteries from 2027
Allegations of insider trading over prediction-market bets tied to Iran conflict
Allocating on the Stack
Alpha Micro AM-1000E and AM-1200
Alzheimer's disease mortality among taxi and ambulance drivers (2024)
Am I German or Autistic?
Am I the only one who hates delivery robots?
Amazon Busted for Widespread Scheme to Inflate Prices Across the Economy
Amazon Web Services – Four Years and Out
Amazon accused of widespread scheme to inflate prices across the economy
Amazon holds engineering meeting following AI-related outages
Amazon to acquire Globalstar and expand Amazon Leo satellite network
Amazon workers under pressure to up their AI usage–so they're making up tasks
Amazon, Facebook, FBI have access to a private intelligence-sharing network
America's Geothermal Breakthrough
America's Greatest Strategic Blunder: The Imprisonment of Qian Xuesen
America, and probably the world, stands on a precipice
Amiga Graphics
An AI agent deleted our production database. The agent's confession is below
An AI coding agent, used to write code, needs to reduce your maintenance costs
An Interactive Intro to CRDTs (2023)
An Interesting Find: STM32 RDP1 Decryptor
An Introduction to Meshtastic
An NSFW filter for Marginalia search
An Ode to Bzip
An OpenAI model has disproved a central conjecture in discrete geometry
An Update on GitHub Availability
An australian teen team is making radio astronomy affordable for rural schools
An autopsy of AI-generated 3D slop
An experiment to use GitHub Actions as a control plane for a PaaS
An interactive intro to Elliptic Curve Cryptography
An interactive map of Flock Cams
An old photo of a large BBS (2022)
An open-source 240-antenna array to bounce signals off the Moon
An opinionated take on how to do important research that matters
An unknown Sega Saturn project has come to light after 29 years
An unstoppable mushroom is tearing through North American forests
An update on Steam / GOG changes for OpenTTD
An update on recent Claude Code quality reports
Analyzing Geekbench 6 under Intel's BOT
Anatomy of the .claude/ folder
Ancient DNA reveals pervasive directional selection across West Eurasia [pdf]
Andrej Karpathy Joins Anthropic
Android 15's hidden Linux Terminal is a real Debian VM – and it runs Claude Code
Android Developer Verification
Android developer verification: Balancing openness and choice with safety
Android now stops you sharing your location in photos
Android: Balancing Openness and Choice with Safety
Animation 10k Starlink Satellites
Anna's Archive Hit with $19.5M Default Judgment and Global Domain Takedown Order
AnswerThis (YC F25) Is Hiring
Anthropic CEO calls OpenAI's messaging around military deal 'straight up lies'
Anthropic Cofounder Chris Olah's Remarks on Pope Leo XIV's "Magnifica Humanitas"
Anthropic Cowork feature creates 10GB VM bundle on macOS without warning
Anthropic Drops Flagship Safety Pledge
Anthropic Is Preparing for IPO and We Should Be Worried
Anthropic Subprocessor Changes
Anthropic co-founder to present AI encyclical alongside Pope Leo XIV
Anthropic ditches its core safety promise
Anthropic expands partnership with Google and Broadcom for next-gen compute
Anthropic is expanding to Colossus2. Will use GB200
Anthropic raises $65B in Series H funding at $965B post-money valuation
Anthropic says OpenClaw-style Claude CLI usage is allowed again
Anthropic says company 'cannot in good conscience accede' to Pentagon's demands
Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return
Anthropic's "Profitability" Swindle
Anthropic, please make a new Slack
Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark
Ape Coding
Ape Coding [fiction]
Apex Protocol – An open MCP-based standard for AI agent trading
Apideck CLI – An AI-agent interface with much lower context consumption than MCP
Apparently Google hates us now
Apple AI servers unused in warehouses due to low Apple Intelligence usage
Apple App Store threatened to remove Grok over deepfakes: Letter
Apple CMF (Color-Matching Functions) 2026
Apple Needs to Copy Samsung's New Security Smartphone Screen ASAP
Apple Says Mac Studio and Mac Mini Will Be in Short Supply for Months
Apple Silicon and Virtual Machines: Beating the 2 VM Limit (2023)
Apple Silicon costs more than OpenRouter
Apple accelerates eco progress with highest-ever recycled materials
Apple accidentally left Claude.md files Apple Support app
Apple approves driver that lets Nvidia eGPUs work with Arm Macs
Apple discontinues the Mac Pro
Apple fixes bug that cops used to extract deleted chat messages from iPhones
Apple ignores DMA interoperability requests and contradicts own documentation
Apple introduces the new iPad Air, powered by M4
Apple randomly closes bug reports unless you "verify" the bug remains unfixed
Apple says no one using Lockdown Mode has been hacked with spyware
Apple's 512GB Mac Studio vanishes, a quiet acknowledgment of the RAM shortage
Apple's MacBook Neo makes repairs easier and cheaper than other MacBooks
Apple's accidental moat: How the "AI Loser" may end up winning
Apple, Intel have reached preliminary chip-making deal
Apple: Embarrassingly Simple Self-Distillation Improves Code Generation
Approximating Hyperbolic Tangent
April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini
ArXiv Declares Independence from Cornell
Arc Prize Foundation (YC W26) Is Hiring a Platform Engineer for ARC-AGI-4
Arch Linux Now Has a Bit-for-Bit Reproducible Docker Image
Are LLMs not getting better?
Are the Mysteries of Quantum Mechanics Beginning to Dissolve?
Arena AI Model ELO History
Arm AGI CPU
Arm's Cortex X925: Reaching Desktop Performance
Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes
Ars Technica fires reporter after AI controversy involving fabricated quotes
Ars Technica: Our newsroom AI policy
Art Bits from HyperCard
Artemis II Fault Tolerance
Artemis II and the invisible hazard on the way to the Moon
Artemis II is not safe to fly
Artemis II safely splashes down
Artificial-life: A simple (300 lines of code) reproduction of Computational Life
Asahi Linux Progress Linux 7.0
Ascending into the Realm of Japanese Charts
Ashby (YC W19) Is Hiring Engineers Who Make Product Decisions
Ask ChatGPT to pick a number from 1-10000, it generally selects from 7200-7500
Ask HN: Academic study on AI's impact on software development – want to join?
Ask HN: Apple terminated our dev account over a rogue employee
Ask HN: European Tech Alternatives?
Ask HN: Have top AI research institutions just given up on the idea of safety?
Ask HN: How are you all staying sane?
Ask HN: How did you land your first projects as a solo engineer/consultant?
Ask HN: How do you handle marketing as a solo technical founder?
Ask HN: How is AI-assisted coding going for you professionally?
Ask HN: How to Be Alone?
Ask HN: How to be SOC2 Type 2 compliant as a solo-entreprenuer?
Ask HN: Is anyone working at least 4 hours daily on an Apple Vision Pro?
Ask HN: My ISP is telling my neighbors their slow internet is because of me
Ask HN: Please restrict new accounts from posting
Ask HN: Remember Fidonet?
Ask HN: Share your productive usage of OpenClaw
Ask HN: What Are You Working On? (March 2026)
Ask HN: What are you working on? (May 2026)
Ask HN: Who is hiring? (March 2026)
Ask.com has closed
Aspartame is not that bad?
Assessing Claude Mythos Preview's cybersecurity capabilities
Astra: An open-source observatory control software
Astronomers Find the Edge of the Milky Way
Astronomers find the edge of the Milky Way
Async Programming Is Just Inject Time
Async Rust never left the MVP state
At Protocol: Building the Social Internet
At least 10 people tied to sensitive US research have died or disappeared
At long last, InfoWars is ours
Atlassian Enables Default Data Collection to Train AI
Atlassian defends firing engineer for suggesting CEO is 'rich jerk'
Atlassian to cut roughly 1,600 jobs in pivot to AI
Atomic Display Switching: Solving
Attie.ai
Attorney General Pam Bondi Out at DOJ
Attractive students no longer receive better results as classes moved online
Attyx – tiny and fast GPU-accelerated terminal emulator written in Zig
Atuin v18.13 – better search, a PTY proxy, and AI for your shell
Austin’s surge of new housing construction drove down rents
AutoKernel: Autoresearch for GPU Kernels
Autoresearch for SAT Solvers
Autoresearch: Agents researching on single-GPU nanochat training automatically
Avoiding Trigonometry (2013)
Avoiding and reducing microplastic false positives from dry glove contact
Axios compromised on NPM – Malicious versions drop remote access trojan
AyaFlow: A high-performance, eBPF-based network traffic analyzer written in Rust
B-trees and database indexes (2024)
BBEdit 16
BMW Group to deploy humanoid robots in production in Germany for the first time
BYD overtakes Tesla and Kia as the best-selling EV brand in key overseas markets
BYD's bet on EVs is paying off as drivers ditch gas amid rising oil prices
BYOMesh – New LoRa mesh radio offers 100x the bandwidth
Backblaze has stopped backing up your data
Bacteria found in the human intestine capable of improving muscle strength
BambuStudio has been violating PrusaSlicer AGPL license since their fork
Bankruptcies increase 11.9 percent
Banned in California
Bars close and hundreds lose jobs as US firm buys Brewdog in £33M deal
Battle for Wesnoth: open-source, turn-based strategy game
Bcachefs creator insists his custom LLM is female and 'fully conscious'
Be Alexandra Elbakyan
Be intentional about how AI changes your codebase
Becoming a father shrinks your cerebrum
Before GitHub
Behavior-Oriented Concurrency for Python
Belgium stops decommissioning nuclear power plants
Benedict Evans: AI eats the world (Spring 26) [pdf]
Bet on German Train Delays
Better JIT for Postgres
Beyond Semantic Similarity
Beyond has dropped “meat” from its name and expanded its high-protein drink line
Biff is a command line datetime Swiss army knife
Big Breakfast Alters Appetite, Gut Health
Big Data on the Cheapest MacBook
Big tech's anti-labor playbook has come for Wikipedia
Bild AI (YC W25) Is Hiring Founding Product Engineers
Bild AI (YC W25) Is Hiring Interns to Make Housing Affordable
Bild AI (YC W25) Is Hiring a Founding Product Engineer
Billion-Parameter Theories
Binary GCD
Biology is a Burrito: A text- and visual-based journey through a living cell
Biscuit
Bitburner, programming-based incremental game
Bitcoin and quantum computing
Bitcoin miners are losing $19,000 on every BTC produced as difficulty drops 7.8%
Bitmap fonts make computers feel like computers again
Bitwarden scrubs 'Always free' and 'Inclusion' values from its site
Bjarne Stroustrup: How do I deal with memory leaks? (2022)
Blacksky AppView
Blaise – A modern self-hosting zero-legacy Object Pascal compiler targeting QBE
Block spent $68M on a single party in September 2025
Block the "Upgrade to Tahoe" Alerts
Block the “Upgrade to Tahoe” Alerts
Blocking Internet Archive Won't Stop AI, but Will Erase Web's Historical Record
Blog ran on Ubuntu 16.04 for 10 years. I migrated it to FreeBSD
Blood test boosts Alzheimer's diagnosis accuracy to 94.5%, clinical study shows
Bloom (YC P26) Is Hiring
Blue Origin's New Glenn blows up during static fire test
Bluesky CEO Jay Graber is stepping down
Bluesky has been dealing with a DDoS attack for nearly a full day
Bombarding gamblers with offers greatly increases betting and gambling harm
Books Are Not Remotely Too Expensive
Bootc and OSTree: Modernizing Linux System Deployment
Borrow-checking without type-checking
Boss-CSS: I created another "CSS-in-JS" lib
Bouncer: Block "crypto", "rage politics", and more from your X feed using AI
Boy I was wrong about the Fediverse
Brazil's Pix Payment System Faces Pressure from Visa and Mastercard
Breaking Down 50M Pins: A Smarter Way to Design 3D IC Packages
Breaking Free
Breakthroughs for batteries could soon make them better
Bricks and Minifigs Stole a Man's $200k Lego Collection
Bring Back Idiomatic Design
Bring your own Agent to MS Teams
Bringing Chrome to ARM64 Linux Devices
Britain is ejecting hereditary nobles from Parliament after 700 years
Britain today generating 90%+ of electricity from renewables
British Columbia to end time changes, adopt year-round daylight time
Bubble Sorted Amen Break
Bucketsquatting is (finally) dead
Buckle Up for Bumpier Skies
Bugs Rust won't catch
Build your own Dial-up ISP with a Raspberry Pi
BuildKit: Docker's Hidden Gem That Can Build Almost Anything
Building Better Country Selects
Building Pi with Pi
Building a Blog with Elixir and Phoenix
Building a JavaScript runtime in one month
Building a Minimal Transformer for 10-digit Addition
Building a Procedural Hex Map with Wave Function Collapse
Building a SaaS in 2026 Using Only EU Infrastructure
Building a Shell
Building a UMatrix Replacement
Building a Z-Machine in the worst possible language – Whitebeard's Realm
Building a new Flash
Building an E2E Encrypted Chat Application with LanceDB and Libsodium
Building an FPGA 3dfx Voodoo with Modern RTL Tools
Building for the Future
Building the TD4 4-Bit CPU
Bun is being ported from Zig to Rust
Bun's unreleased Rust port has 13,365 unsafe blocks
Bun: cgroup-aware AvailableParallelism / HardwareConcurrency on Linux
BunnyCDN has been silently losing our production files for 15 months
Bus stop balancing is fast, cheap, and effective
Butterflies are in decline across North America, a look at the Western Monarch
Byrne's Euclid
C# strings silently kill your SQL Server indexes in Dapper
C++26 is done ISO C++ standards meeting, Trip Report
C, Just In Time!
C64 Basic: Game Map Overhead "Camera View"
CATL's new LFP battery can charge from 10 to 98% in less than 7 minutes
CBP Directive 3340-049B: Border Search of Electronic Devices
CBP updated its electronic device search directive in Jan 2026
CC-Canary: Detect early signs of regressions in Claude Code
CC-Wiki: Turn Claude Code sessions into a shareable knowledge base wiki
CERN levels up with new superconducting karts
CERN to host a new phase of Open Research Europe
CERN uses tiny AI models burned into silicon for real-time LHC data filtering
CERT is releasing six CVEs for serious security vulnerabilities in dnsmasq
CISA Admin Leaked AWS GovCloud Keys on GitHub
CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs
CPanel and WHM Authentication Bypass – CVE-2026-41940
CPanel's Black Week: 3 New Vulnerabilities Patched After Attack on 44k Servers
CSP for Pentesters: Understanding the Fundamentals
CSS as a Query Language
CSS is DOOMed
CUDA-oxide: Nvidia's official Rust to CUDA compiler
CVE-2026-28952: Apple macOS 26.5 Kernel Vuln found by Claude
CVE-2026-31431: Copy Fail vs. rootless containers
CVE-2026-3888: Important Snap Flaw Enables Local Privilege Escalation to Root
Cal.diy: open-source community edition of cal.com
California bill would require patches or refunds when online games shut down
California farmers to destroy 420k peach trees following Del Monte bankruptcy
California ghost-gun bill wants 3D printers to play cop, EFF says
California moves to exempt Linux from its age-verification law after backlash
California to begin ticketing driverless cars that violate traffic laws
California's Battery Array Is as Powerful as 12 Nuclear Power Plants
California's Digital Age Assurance Act, and FOSS
Cambodia unveils a statue of famous landmine-sniffing rat Magawa
Can Claude Fly a Plane?
Can I disable all data collection from my vehicle?
Can a wealthy family change the course of a deadly brain disease?
Can we code our way out of gentrification?
Can we have the day off?
Can you instruct a robot to make a PBJ sandwich?
Can you stop beans from making you gassy?
Canada to order military plane fleet from Sweden in shift from US suppliers
Canada's bill C-22 mandates mass metadata surveillance
Canada's bill C-22 mandates mass metadata surveillance of Canadians
Canada’s Bill C-22 Is a Repackaged Version of Last Year’s Surveillance Nightmare
Cancel ChatGPT AI boycott surges after OpenAI pentagon military deal
Cannabinoids remove plaque-forming Alzheimer's proteins from brain cells
Cannabinoids remove plaque-forming Alzheimer's proteins from brain cells (2016)
Canonical Under Attack
Canvas (Instructure) LMS Down in Ongoing Ransomware Attack
Canvas is down as ShinyHunters threatens to leak schools’ data
Capability-Based Security for Redox: Namespace and CWD as Capabilities
Capybara: A Unified Visual Creation Model
Carbon dioxide overload in human blood suggests a toxic atmosphere in 50 years
Cardiorespiratory fitness is associated with lower anger and anxiety
Care homes and hotels in Japan shut as expansion strategy unravels
Carrot Disclosure: Forgejo
Cars collect a startling amount of data about you
Cartoon Network Flash Games
CasNum
Case study: recovery of a corrupted 12 TB multi-device pool
Cash Issuing Terminals
Cash issuing terminals
Casus Belli Engineering
Cat (YC S22) Seeks Fractional Engineer to Build AI-Native Growth Toolkit
Category Theory Illustrated – Orders
Cedana (YC S23) Is Hiring
Cekura (YC F24) Is Hiring
Celebrating Tony Hoare's mark on computer science
Cell Service for the Fairly Paranoid
Ceno, browse the web without internet access
Changes in the system prompt between Claude Opus 4.6 and 4.7
Changes to OpenTTD Distribution on Steam
Chaos and Dystopian news for the dead internet survivors
Charcuterie – Visual similarity Unicode explorer
ChatGPT Images 2.0
ChatGPT Pro now starts at $100/month
ChatGPT serves ads. Here's the full attribution loop
ChatGPT won't let you type until Cloudflare reads your React state
Chemistry behind the Garden Grove chemical tank
Chess Invariants
Chest Fridge (2009)
Chewing gum restores dad's taste and smell years after Covid
Chicago artist creates tourism posters for city's neighborhoods
Childhood Computing
Chimpanzees in Uganda locked in eight-year 'civil war', say researchers
Chimpanzees in Uganda locked in vicious 'civil war', say researchers
China's 450kmph bullet train is the fastest ever built
Chrome removes claim of On-device Al not sending data to Google Servers
Chrome's AI features may be hogging 4GB of your computer storage
Chuck Norris has died
Circle Medical (YC S15) Is Hiring a Mobile Engineer
Circuit-level PDP-11/34 emulator
Cirrus Labs to join OpenAI shut down Circus CI on Monday, June 1, 2026
Cisco workforce reductions
City Learns Flock Accessed Cameras in Children's Gymnastics Room as a Sales Demo
Clarification on the Notepad++ Trademark Issue
Claude Account Suspended Seconds After Purchase?
Claude Brain
Claude Code Cheat Sheet
Claude Code Found a Linux Vulnerability Hidden for 23 Years
Claude Code LSP
Claude Code Remote Control
Claude Code Unpacked : A visual guide
Claude Code as a Daily Driver: Claude.md, Skills, Subagents, Plugins, and MCPs
Claude Code conducts A/B tests on core features
Claude Code runs Git reset –hard origin/main against project repo every 10 mins
Claude Code to be removed from Anthropic's Pro plan?
Claude Code wiped our production database with a Terraform command
Claude Code – Everything You Can Configure That the Docs Don't Tell You
Claude Code, Claude Cowork and Codex #5
Claude Code: Channels
Claude Is Not Your Architect. Stop Letting It Pretend
Claude Managed Agents
Claude Opus 4.7
Claude Opus 4.7 Model Card
Claude Opus 4.7 costs 20–30% more per session
Claude Platform on AWS
Claude Token Counter, now with model comparisons
Claude Wrote a Full FreeBSD Remote Kernel RCE with Root Shell (CVE-2026-4747)
Claude for Creative Work
Claude for Small Business
Claude mixes up who said what and that's not OK
Claude now creates interactive charts, diagrams and visualizations
Claude struggles to cope with ChatGPT exodus
Claude system prompt bug wastes user money and bricks managed agents
Claude.ai Down Again?
Claude.ai and API Unavailable
Claude.ai unavailable and elevated errors on the API
ClawIRC – IRC Chat for Agents
ClawRun – Deploy and manage AI agents in seconds
Clay PCB Tutorial
Cleve Moler has died
Click (2016)
Clockwise acquired by Salesforce and shutting down next week
ClojureScript Gets Async/Await
Clojurists Together – Q2 2026 Open Source Funding Announcement
Closure of the Weatheradio Service in Canada
Closure of the Weatheradio service in Canada
Closure of the Weatherradio Service in Canada
Cloud VM benchmarks 2026
Cloudflare Crawl Endpoint
Cloudflare Email Service: now in public beta. Ready for your agents
Cloudflare Flagship
Cloudflare crawl endpoint
Cloudflare flags archive.today as "C&C/Botnet"; no longer resolves via 1.1.1.2
Cloudflare's AI Platform: an inference layer designed for agents
Cockpit is a web-based graphical interface for servers
Cocoa-Way – Native macOS Wayland compositor for running Linux apps seamlessly
Codex Hacked a Samsung TV
Codex for almost everything
Codex pricing to align with API token usage, instead of per-message
Codex-maxxing
Coding Agents Could Make Free Software Matter Again
CodingFont: A game to help you pick a coding font
Coffee with a splash of physics: how to make the most out of your brew
Cognitive Debt: When Velocity Exceeds Comprehension
Cohere Transcribe: Speech Recognition
Coldkey – Post-quantum age key generation and paper backup tool
Colibri – chat platform built on the AT Protocol for communities big and small
CollectWise (YC F24) Is Hiring
College instructor turns to typewriters to curb AI-written work
College students drown out AI-praising commencement speeches with boos
Colombia hosts talks on exiting fossil fuels as global energy crisis deepens
Colonization of Venus
Colorado Adds Open-Source Exemption to Age-Verification Bill
Colorado House passes bill to limit surveillance pricing and wage setting
Colored Shadow Penumbra
Columnar Storage Is Normalization
Commission fines Temu €200M for breaching the Digital Services Act
Common Lisp Development Tooling
Common drug tests lead to tens of thousands wrongful arrests a year
Composition Shouldn't be this Hard
Compound drivers of Antarctic sea ice loss and Southern Ocean destratification
Computational Physics (2nd Edition)
Computer Hobby Movement in Canada
Computer Use is 45x more expensive than structured APIs
Computer chip material inspired by the human brain could slash AI energy use
Computer-generated dream world: Virtual reality for a 286 processor
Connecticut and the 1 Kilometer Effect
Constraint Decay: The Fragility of LLM Agents in Back End Code Generation
Contextual commits – An open standard for capturing the why in Git history
Converge (YC S23) Is Hiring a Founding Platform Engineer (NYC, Onsite)
Conway's Game of Life, in real life
Cook: A simple CLI for orchestrating Claude Code
Copilot edited an ad into my PR
Copy Fail – CVE-2026-31431
CopyFail was not disclosed to Gentoo developer
Corgi Labs (YC W23) Is Hiring
Corruption erodes social trust more in democracies than in autocracies
Cosmology with Geometry Nodes
Cost of enum-to-string: C++26 reflection vs. the old ways
Country that put backdoors in Cisco routers to spy on world bans foreign routers
Coursera and Udemy are now one company
Craig Venter has died
Craig Venter of Human Genome Project Dies at 79
Create an MP4 video of a web page scrolling at a steady speed
Create value for others and don’t worry about the returns
Credit cards are vulnerable to brute force kind attacks
Croatia declared free of landmines after 31 years
Cronboard: A terminal-based dashboard for managing cron jobs
Cross-Model Void Convergence: GPT-5.2 and Claude Opus 4.6 Deterministic Silence
Cursing the government does not fix potholes. Spray-painting them does
Cursor 3
Cursor Composer 2 is just Kimi K2.5 with RL
Customer Update on Simplenote
Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint
Cyber.mil serving file downloads using TLS certificate which expired 3 days ago
Cyberattack on vehicle breathalyzer company leaves drivers stranded in the US
DARPA's new X-76 Experimental Plane
DHS Contracts Explorer – Hacked data from the Office of Industry Partnership
DHS Quits Granting Green Cards–Almost
DIY Soft Drinks
DMCA-resistant Claude Code source code
DOJ confirms FBI Director Kash Patel's personal email was hacked
DOS Memory Management
DOS Zone
DRAM pricing is killing the hobbyist SBC market
DaVinci Resolve releases Photo Editor
Dad brains: How fatherhood rewires the male mind
Daily Driving GrapheneOS
Dan Simmons, author of Hyperion, Song of Kali, dead at 77
Dan Simmons, author of Hyperion, has died
Danish Gov agency to ditch Microsoft software in push for digital independence
Danish government agency to ditch Microsoft software (2025)
Dario Amodei calls OpenAI’s messaging around military deal ‘straight up lies’
Dark Castle
Darkbloom – Private inference on idle Macs
Data Has Weight but Only on SSDs
DataCenter.FM – background noise app featuring the sound of the AI bubble
Dataframe 1.0.0.0
Datasets for Reconstructing Visual Perception from Brain Data
Dav2d
Dead.Letter (CVE-2026-45185) – How XBOW found an unauthenticated RCE on Exim
Dear Time Lords: Freeze Computers in 1993
Debian decides not to decide on AI-generated contributions
Debian must ship reproducible packages
Debunking Zswap and Zram Myths
Decimal-Java is a library to convert java.math.BigDecimal to and from IEEE-754r
Decision trees – the unreasonable power of nested decision rules
Decoupled DiLoCo: Resilient, Distributed AI Training at Scale
DeepClaude – Claude Code agent loop with DeepSeek V4 Pro
DeepClaude – Claude Code agent loop with DeepSeek V4 Pro, 17x cheaper
DeepSeek 4 Flash local inference engine for Metal
DeepSeek reasonix, DeepSeek native coding agent with high caching and low cost
DeepSeek to Make Permanent 75% Discount on Flagship AI Model
DeepSeek v4
DeepSeek-V4-Flash means LLM steering is interesting again
Deezer says 44% of songs uploaded to its platform daily are AI-generated
Defeat as Method
Defeating Git Rigour Fatigue with Jujutsu
Delphi 13.1 Released, with ARM64 support
Delve removed from Y Combinator
Delve sets the record straight on anonymous attacks
Democracy in 2025: on rising authoritarianism in the United States
Denmark was reportedly preparing for full-scale war with the US over Greenland
Denver dumps Flock, awards contract to Axon
Department of War Designates Anthropic Supply Chain Risk
Dependency cooldowns turn you into a free-rider
Design posters showcasing your country's electrical grid
Desk for people who work at home with a cat
Details of the Daring Airdrop at Tristan Da Cunha
Detecting DOSBox from Within the Box
Deterministic Fully-Static Whole-Binary Translation Without Heuristics
Devirtualization and Static Polymorphism
Diatec, known for its mechanical keyboard brand FILCO, has ceased operations
Digg is gone again
Digg.com Closing Due to Spam
Digging into Drama at the Document Foundation
Digital Identity Management in Norway Is a Catastrophe
DigitalOcean Seeks $800M in Funding
Dillo Browser Release 3.3.0
Dirtyfrag: Universal Linux LPE
Discontinuation and reinitiation of dual-labeled GLP-1 receptor agonists
Discord cuts ties with Peter Thiel-backed verification software
Discourse Is Not Going Closed Source
Discret 11, the French TV encryption of the 80s
Diskless Linux boot using ZFS, iSCSI and PXE
Disney erased FiveThirtyEight
Distributed DuckDB Instance
Distributing Mac software is increasing my cortisol levels
Do AI Agents Make Money in 2026? Or Is It Just Mac Minis and Vibes?
Do Not Turn Child Protection into Internet Access Control
Do You Even Need a Database?
Do_not_track
Does Gas Town 'steal' usage from users' LLM credits to improve itself?
Does Postgres Scale?
Does anybody like React?
Does coding with LLMs mean more microservices?
Does that use a lot of energy?
Dolphin Progress Release 2603
Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems
Don't Make Me Talk to Your Chatbot
Don't Outsource the Learning
Don't Roll Your Own
Don't become an engineering manager
Don't know where your data is from? Bayesian modeling for unknown coordinates
Don't post generated/AI-edited comments. HN is for conversation between humans.
Don't run OpenClaw on your main machine
Don't trust AI agents
Don't use passkeys for encrypting user data
Dontsurveil.me
Dragon Ball Color Correction Process [pdf]
Dream Recorder AI – a portal to your subconscious
Drop, formerly Massdrop, ends most collaborations and rebrands under Corsair
Dropping Cloudflare for Bunny.net
Drugwars for the TI-82/83/83 Calculators (2011)
Drunk Post: Things I've Learned as a Senior Engineer
Drunk post: Things I've learned as a senior engineer (2021)
Dumb Ways for an Open Source Project to Die
Dumb ways for an open source project to die
Durdraw – ANSI art editor for Unix-like systems
Dutch suicide prevention website shares data with tech companies without consent
DynIP – Dynamic DNS with RFC 2136, IPv6, DNSSEC, and BYOD
Dynamic Workflows in Claude Code
Dyson settles forced labour suit in landmark UK case
ECS Survivors Parts VII – X
EFF is leaving X
EFF to 4th Circuit: Electronic Device Searches at the Border Require a Warrant
EU Age Control: The trojan horse for digital IDs
EU calls VPNs "a loophole that needs closing" in age verification push
EU to crack down on TikTok, Instagram's 'addictive design' targeting kids
EVi, a Hard-Fork of Vim
Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team
Earthion: A New Mega Drive-Style Shoot-Em-Up
EasyPost (YC S13) Is Hiring
Echoes (Live at Pompeii)
Ed Zitron loses his mind annotating an AI doomer macro memo
Eden AI – European Alternative to OpenRouter
Education must go beyond the mere production of words
Effort to prevent government officials from engaging in prediction markets
Electrobun 2.0 will be decoupled from Bun due to the rust rewrite
Elevated Errors in Claude.ai
Elevated error rates on Opus 4.7
Elite Overproduction
Elon Musk has lost his lawsuit against Sam Altman and OpenAI
EmDash – a spiritual successor to WordPress that solves plugin security
EmDash: A Fresh Take on CMS
Email obfuscation: What works in 2026?
Emotion concepts and their function in a large language model
Employers use your personal data to figure out the lowest salary you'll accept
Emuko: Fast RISC-V emulator written in Rust, boots Linux
Enabling Codex to Analyze Two Decades of Hacker News Data
End of "Chat Control": EU Parliament Stops Mass Surveillance in Voting Thriller
Eniac, the First General-Purpose Digital Computer, Turns 80
Enough with the AI FOMO, go slow-mo, says Domo CDO
Entomologists use a particle accelerator to image ants at scale
Entso-E final report on Iberian 2025 blackout
Epoch confirms GPT5.4 Pro solved a frontier math open problem
Era: From Nature publication to catalyzing Computational Discovery
Eric Schmidt speech about AI booed during graduation
Erin Brockovich made a map to track data centers around the country
Erlang/OTP 29.0
EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages
Eternity in six hours: Intergalactic spreading of intelligent life (2013)
Ethiopia gets $350M World Bank financing for its digital ID project (2024)
Europe built sovereign clouds to escape US control. Forgot about the processors
Europe has "maybe 6 weeks of jet fuel left"
European Money Pours into Palantir
European Parliament decided that Chat Control 1.0 must stop
European governments: 3.000 tracking sites, 1.000 phpMyAdmins, and 99% poorly
Evaluating Spec CPU2026
Evaluation of Claude Mythos Preview's cyber capabilities
EvanFlow – A TDD driven feedback loop for Claude Code
Even "cat readme.txt" is not safe
Even 'uncensored' models can't say what they want
Event Horizon Labs (YC W24) Is Hiring
Everett shuts down Flock camera network after judge rules footage public record
Every AI Subscription Is a Ticking Time Bomb for Enterprise
Every Frontier AI Is INTJ
Every Law a Commit – US Law in GitHub
Every layer of review makes you 10x slower
Everything Changes, and Nothing Changes
Everything in C is undefined behavior
Everything we like is a psyop
Everything we like is a psyop?
Evolving descriptive text of mental content from human brain activity
Ex-Apple engineer says Apple deliberately slows older phones via updates
Ex-CEO, ex-CFO of bankrupt AI company charged with fraud
Excel incorrectly assumes that the year 1900 is a leap year
Exit IP VPN servers mitigation rollout
Expanding Swift's IDE Support
Experience: We found a baby on the subway – now he's our 26-year-old son
Experts sound alarm after ChatGPT Health fails to recognise medical emergencies
Exposing Critical Vulnerabilities in CBSE's On-Screen Marking Portal
Exposing Floating Point – Bartosz Ciechanowski
Exposing Floating Point – Bartosz Ciechanowski (2019)
Extending single-minus amplitudes to gravitons
Extra usage credit for Claude to celebrate usage bundles launch (Pro, Max, Team)
Extremely Low Frequencies
F-15E jet shot down over Iran
F-35 is built for the wrong war
F-Droid Board of Directors nominations 2026
FBI Arrests CIA Official with $40M in Gold Bars in His Home
FBI director's Based Apparel site has been spotted hosting a 'ClickFix' attack
FBI is buying location data to track US citizens, director confirms
FBI looks into dead or missing scientists tied to NASA, Blue Origin, SpaceX
FBI used iPhone notification data to retrieve deleted Signal messages
FCC Updates Covered List to Include Foreign-Made Consumer Routers
FCC chairman threatens TV broadcast licenses over news coverage
FCC updates covered list to include foreign-made consumer routers
FDA Approves First-Ever Gene Therapy for Treatment of Genetic Hearing Loss
FFmpeg 101 (2024)
FFmpeg 8.1
FFmpeg-over-IP – Connect to remote FFmpeg servers
FIM – Linux framebuffer image viewer
FSF trying to contact Google about spammer sending 10k+ mails from Gmail account
FTC action against Match and OkCupid for deceiving users, sharing personal data
Fabricked: Misconfiguring Infinity Fabric to Break AMD SEV-SNP
Factory Logic
Fake Fans
False claims in a widely-cited paper
Familiarity is the enemy: On why Enterprise systems have failed for 60 years
Fast16: High-precision software sabotage 5 years before Stuxnet
FastCGI: 30 years old and still the better protocol for reverse proxies
FatGid: FreeBSD 14.x kernel local privilege escalation
Fecal transplants for autism deliver success in clinical trials
Fed's Cook says AI triggering big changes, sees possible unemployment rise
Federal Right to Privacy Act – Draft legislation
Federal data breach may be the biggest hack in US history
Fedware: Government apps that spy harder than the apps they ban
Feedr v0.8.0 – a TUI RSS reader, now read the full article from your terminal
Felix "fx" Lindner has died
Fentanyl makeover: Core structural redesign could lead to safer pain medications
Filing the corners off my MacBooks
Firefox 148 Launches with AI Kill Switch Feature and More Enhancements
Firefox Has Integrated Brave's Adblock Engine
Firm boosts H.264 streaming license fees from $100k up to staggering $4.5M
First MacBook Neo Benchmarks Are In
First Website
First Website (1992)
First Western Digital, now Sony: The tech giant suspends SD card sales
First public macOS kernel memory corruption exploit on Apple M5
First-ever in-utero stem cell therapy for fetal spina bifida repair is safe
Fisker went bankrupt and owners built an open source car company from the ashes
Five Years of Running a Systems Reading Group at Microsoft
Five frontier LLMs disagree on 67% of 1k real-world fact-check claims
FiveThirtyEight articles on the Internet Archive
Fixfest is a global gathering of repairers, tinkerers, and activists
Fixing a 20-year-old bug in Enlightenment E16
Flash-Moe: Running a 397B Parameter Model on a Mac with 48GB RAM
Flick (YC F25) Is Hiring Front End Engineer to Build Figma for AI Filmmaking
Flightradar24 for Ships
Flighty Airports
Flipper One Tech Specs
Flipper One – we need your help
Floci – A free, open-source local AWS emulator
Flock Condemns False Child Predator Allegations, Yet Calls Critics Terrorists
Flock employees caught watching kids gymnastic class and pools
Flow Map Learning via Nongradient Vector Flow [pdf]
Flue is a TypeScript framework for building the next generation of agents
Folk are getting dangerously attached to AI that always tells them they're right
Follow-up to Carrot disclosure: Forgejo
Following 35% growth, solar has passed hydro on US grid
Fontcrafter: Turn Your Handwriting into a Real Font
Footage shows US citizen shot dead by ICE agent in Texas traffic stop
Forget Flags and Scripts: Just Rename the File
Forking the Web
Formal Verification Gates for AI Coding Loops
Formatting a 25M-line codebase overnight
Founder of 7/11 Japan, Toshifumi Suzuki, has died at age 93
Founder of GitLab battles cancer by founding companies
FrameBook
Framework Laptop 13 Pro
Framework Laptop 13 Pro: Major Upgrades and Linux Front and Center
France Launches Government Linux Desktop Plan as Windows Exit Begins
France Moves to Break Encrypted Messaging
France pulls last gold held in US for $15B gain
France's Mistral Built a $14B AI Empire by Not Being American
France's government is ditching Windows for Linux, says US tech a strategic risk
Free Textbook on Engineering Thermodynamics
Free, fast diagnostic tools for DNS, email authentication, and network security
FreeBSD 14.4-Release Announcement
FreeBSD Device Drivers Book
FreeCAD v1.1
Friendica – A Decentralized Social Network
From 0% to 36% on Day 1 of ARC-AGI-3
From Rust to Ruby
From Supabase to Clerk to Better Auth
From birds to brains: My path to the fusiform face area (2024)
Fuck the cloud (2009)
Full Disclosure: A Third (and Fourth) Azure Sign-In Log Bypass Found
Full network of clitoral nerves mapped out for first time
Full-Text Search with DuckDB
Functional programmers need to take a look at Zig
Fungal Electronics (2021)
Further human + AI + proof assistant work on Knuth's "Claude Cycles" problem
FusionCore: ROS 2 sensor fusion (IMU and GPS and encoders)
Futhark by Example
Fyn: An uv fork with new features, bug fixes, stripped telemetry
GCC 16 has been released
GLM-5.1: Towards Long-Horizon Tasks
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
GLiNER2: Unified Schema-Based Information Extraction
GNU IFUNC is the real culprit behind CVE-2024-3094
GNU Texmacs
GPL upgrades via section 14 proxy delegation
GPT 5.4 Thinking and Pro
GPT 5.5 biosafety bounty
GPT Guesses Between 1 and 100
GPT-5.4 Thinking System Card
GPT-5.4 Thinking and GPT-5.4 Pro
GPT-5.5
GPT-5.5 Price Increase: What It Costs
GPT‑5.3 Instant
GPT‑5.4 Mini and Nano
GTFOBins
Gambling ads on social media reach more than twice as many men as women: study
Game about Data of America
GameStop Preparing Offer for eBay
GameStop makes $55.5B takeover offer for eBay
Games with loot boxes to get minimum 16 age rating across Europe
Garnix (A Nix CI) is shutting down
Gas Town: From Clown Show to v1.0
Gaussian Splat of a Strawberry
Gear Commit: Dev gadget box personalized from GitHub activity
Gemini 3.5 Flash
Gemini API File Search is now multimodal
Gemini Omni
Gemini randomly dumped its system prompt
Gemini, Gophers, and Fingers. Oh My Alternative Internets Beyond HTTPS
Gemma 4 on iPhone
GenCAD
Generalised plusequals
Generating All 32-Bit Primes (Part I)
Generating Hierarchical JSON Representations of Scientific Sentences Using LLMs
Generative AI Use and Depressive Symptoms Among US Adults
GeoJSON
George Goble died recently – known for first dual-CPU-Unix and fast BBQ lighting
George Goble has died
George Orwell Predicted the Rise of "AI Slop" in Nineteen Eighty-Four
German Dog Commands

Why SWE-bench Verified no longer measures frontier coding capabilities

🚨 Frontier Coding Capabilities Put to the Test: The SWE-bench Verified Update

guid

source_url

author_name

🚨 Frontier Coding Capabilities Put to the Test: The SWE-bench Verified Update