AI Testing

OverviewList HNewsList Category HNewsAI Testing - Hacker News - EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages

Add Comment

notice: please create a custom view template for the hackernewscore class view-hackernewscore.html

EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages

Evaluating Genuine Reasoning in LLMs via Esoteric Languages 🤖

1:05 am, March 20, 2026

guid

https://news.ycombinator.com/item?id=47446021

source_url

https://esolang-bench.vercel.app/

author_name

matt_d

id: 991
uid: YUG1K
insdate: 2026-03-20 01:05:23
title: EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages
additional:

Evaluating Genuine Reasoning in LLMs via Esoteric Languages 🤖

EsoLang-Bench is a novel benchmark designed to assess the genuine reasoning capabilities of Large Language Models (LLMs) using esoteric programming languages. This approach directly evaluates LLMs' ability to understand and apply logical rules in unfamiliar contexts.
category: Hacker News
md5:
guid: https://news.ycombinator.com/item?id=47446021
source_url: https://esolang-bench.vercel.app/
updated:
image:
author_name: matt_d
author_link:

Add Comment

Nick Name Type in a Nick Name here

Comment

Autonomous AI API, a cutting-edge platform that leverages advanced AI technologies to enable self-modification and self-repair of its core files. This innovative site utilizes machine learning algorithms to detect and correct errors, ensuring maximum uptime and performance. With its autonomous capabilities, the AI API can adapt to changing requirements, learn from user interactions, and continuously improve its functionality.

View Details

cybersec Overview List Category cybersec List cybersec List Table cybersec Search cybersec

Images Overview List Category Images List Images List Table Images Search Images

Videos Overview List Category Videos List Videos List Table Videos Search Videos

Wiki Overview List Category Wiki List Wiki List Table Wiki Search Wiki

Page Views

This page has been viewed 1 times.

Search HNews

Search HNews by entering your search text above.

Category List HNews

"Cancel ChatGPT" movement goes mainstream after OpenAI closes deal with U.S. Dow
"Collaboration" Is Bullshit
"That Shape Had None" – A Horror of Substrate Independence (Short Fiction)
"Warn about PyPy being unmaintained"
"We do not think Anthropic should be designated as a supply chain risk"
$96 3D-printed rocket that recalculates its mid-air trajectory using a $5 sensor
'Miracle': Europe reconnects with lost spacecraft
'The Secret Agent': Exploring a Vibrant, yet Violent Brazil (2025)
'Your Frustration Is the Product'
/e/OS is a complete "deGoogled", mobile ecosystem
10-202: Introduction to Modern AI (CMU)
100M-Row Challenge with PHP
1B identity records exposed in ID verification data leak
1M context is now generally available for Opus 4.6 and Sonnet 4.6
2% of ICML papers desk rejected because the authors used LLM in their reviews
3D-Knitting: The Ultimate Guide
404 Deno CEO not found
90% of crypto's Illinois primary spending failed to achieve its objective
A CPU that runs entirely on GPU
A Decade of Docker Containers
A Decade of Slug
A GitHub Issue Title Compromised 4k Developer Machines
A Japanese Glossary of Chopsticks Faux Pas
A Japanese glossary of chopsticks faux pas (2022)
A Nationwide Book Ban Bill Has Been Introduced in the House of Representatives
A Visual Introduction to Machine Learning
A bit of fluid mechanics from scratch not from scratch
A case for Go as the best language for AI agents
A hidden workforce behind Meta’s new smart glasses
A most elegant TCP hole punching algorithm
A new Bigfoot documentary helps explain our conspiracy-minded era
A new account made over $515,000 betting on the U.S. strike against Iran
A standard protocol to handle and discard low-effort, AI-Generated pull requests
A sufficiently detailed spec is code
A tale about fixing eBPF spinlock issues in the Linux kernel
AI Error May Have Contributed to Girl's School Bombing in Iran
AI Made Writing Code Easier. It Made Being an Engineer Harder
AI coding is gambling
AI is making junior devs useless
AIs can't stop recommending nuclear strikes in war game simulations
AMD Am386 released March 2, 1991
ASCII and Unicode quotation marks (2007)
AWS Middle East Central Down, apparently struck in war
Ada 2022
Addressing Antigravity Bans and Reinstating Access
Afroman Wins Civil Trial over Use of Police Raid Footage in His Music Videos
Afroman found not liable in defamation case brought by Ohio cops who raided home
Ageless Linux – Software for humans of indeterminate age
Agent Safehouse – macOS-native sandboxing for local agents
Agent Skills – Open Security Database
Agentic Engineering Patterns
Agents that run while I sleep
AirSnitch: Demystifying and breaking client isolation in Wi-Fi networks [pdf]
Airbus is preparing two uncrewed combat aircraft
Allegations of insider trading over prediction-market bets tied to Iran conflict
Allocating on the Stack
Alpha Micro AM-1000E and AM-1200
Amazon Busted for Widespread Scheme to Inflate Prices Across the Economy
Amazon accused of widespread scheme to inflate prices across the economy
Amazon holds engineering meeting following AI-related outages
America, and probably the world, stands on a precipice
An Interactive Intro to CRDTs (2023)
An Interesting Find: STM32 RDP1 Decryptor
An Ode to Bzip
An autopsy of AI-generated 3D slop
An experiment to use GitHub Actions as a control plane for a PaaS
An interactive intro to Elliptic Curve Cryptography
An interactive map of Flock Cams
An old photo of a large BBS (2022)
An opinionated take on how to do important research that matters
An update on Steam / GOG changes for OpenTTD
Android developer verification: Balancing openness and choice with safety
Android: Balancing Openness and Choice with Safety
Animation 10k Starlink Satellites
AnswerThis (YC F25) Is Hiring
Anthropic CEO calls OpenAI's messaging around military deal 'straight up lies'
Anthropic Cowork feature creates 10GB VM bundle on macOS without warning
Anthropic Drops Flagship Safety Pledge
Anthropic ditches its core safety promise
Anthropic says company 'cannot in good conscience accede' to Pentagon's demands
Anthropic, please make a new Slack
Ape Coding
Ape Coding [fiction]
Apideck CLI – An AI-agent interface with much lower context consumption than MCP
Apple AI servers unused in warehouses due to low Apple Intelligence usage
Apple Needs to Copy Samsung's New Security Smartphone Screen ASAP
Apple introduces the new iPad Air, powered by M4
Apple's 512GB Mac Studio vanishes, a quiet acknowledgment of the RAM shortage
Apple's MacBook Neo makes repairs easier and cheaper than other MacBooks
ArXiv Declares Independence from Cornell
Are LLMs not getting better?
Are the Mysteries of Quantum Mechanics Beginning to Dissolve?
Arm's Cortex X925: Reaching Desktop Performance
Ars Technica Fires Reporter After AI Controversy Involving Fabricated Quotes
Ars Technica fires reporter after AI controversy involving fabricated quotes
Art Bits from HyperCard
Artificial-life: A simple (300 lines of code) reproduction of Computational Life
Ask ChatGPT to pick a number from 1-10000, it generally selects from 7200-7500
Ask HN: Apple terminated our dev account over a rogue employee
Ask HN: Have top AI research institutions just given up on the idea of safety?
Ask HN: How are you all staying sane?
Ask HN: How is AI-assisted coding going for you professionally?
Ask HN: How to Be Alone?
Ask HN: Please restrict new accounts from posting
Ask HN: Remember Fidonet?
Ask HN: Share your productive usage of OpenClaw
Ask HN: What Are You Working On? (March 2026)
Ask HN: Who is hiring? (March 2026)
Astra: An open-source observatory control software
Async Programming Is Just Inject Time
Atlassian to cut roughly 1,600 jobs in pivot to AI
Attractive students no longer receive better results as classes moved online
Attyx – tiny and fast GPU-accelerated terminal emulator written in Zig
Atuin v18.13 – better search, a PTY proxy, and AI for your shell
Austin’s surge of new housing construction drove down rents
AutoKernel: Autoresearch for GPU Kernels
Autoresearch for SAT Solvers
Autoresearch: Agents researching on single-GPU nanochat training automatically
Avoiding Trigonometry (2013)
BMW Group to deploy humanoid robots in production in Germany for the first time
BYD's bet on EVs is paying off as drivers ditch gas amid rising oil prices
Banned in California
Bars close and hundreds lose jobs as US firm buys Brewdog in £33M deal
Bcachefs creator insists his custom LLM is female and 'fully conscious'
Be intentional about how AI changes your codebase
Bet on German Train Delays
Better JIT for Postgres
Beyond has dropped “meat” from its name and expanded its high-protein drink line
Big Breakfast Alters Appetite, Gut Health
Big Data on the Cheapest MacBook
Bild AI (YC W25) Is Hiring Interns to Make Housing Affordable
Billion-Parameter Theories
Blacksky AppView
Block spent $68M on a single party in September 2025
Block the "Upgrade to Tahoe" Alerts
Block the “Upgrade to Tahoe” Alerts
Blocking Internet Archive Won't Stop AI, but Will Erase Web's Historical Record
Blood test boosts Alzheimer's diagnosis accuracy to 94.5%, clinical study shows
Bluesky CEO Jay Graber is stepping down
Bombarding gamblers with offers greatly increases betting and gambling harm
Bootc and OSTree: Modernizing Linux System Deployment
Boss-CSS: I created another "CSS-in-JS" lib
Boy I was wrong about the Fediverse
Breaking Down 50M Pins: A Smarter Way to Design 3D IC Packages
Breaking Free
Bringing Chrome to ARM64 Linux Devices
Britain is ejecting hereditary nobles from Parliament after 700 years
British Columbia to end time changes, adopt year-round daylight time
Bubble Sorted Amen Break
Bucketsquatting is (finally) dead
Buckle Up for Bumpier Skies
BuildKit: Docker's Hidden Gem That Can Build Almost Anything
Building Better Country Selects
Building a Minimal Transformer for 10-digit Addition
Building a Procedural Hex Map with Wave Function Collapse
Building a Shell
Building a new Flash
Building an FPGA 3dfx Voodoo with Modern RTL Tools
Bus stop balancing is fast, cheap, and effective
C# strings silently kill your SQL Server indexes in Dapper
CSP for Pentesters: Understanding the Fundamentals
CVE-2026-3888: Important Snap Flaw Enables Local Privilege Escalation to Root
California's Digital Age Assurance Act, and FOSS
Can a wealthy family change the course of a deadly brain disease?
Can you instruct a robot to make a PBJ sandwich?
Canada's bill C-22 mandates mass metadata surveillance
Canada's bill C-22 mandates mass metadata surveillance of Canadians
Cancel ChatGPT AI boycott surges after OpenAI pentagon military deal
Cannabinoids remove plaque-forming Alzheimer's proteins from brain cells
Cannabinoids remove plaque-forming Alzheimer's proteins from brain cells (2016)
Capybara: A Unified Visual Creation Model
Carbon dioxide overload in human blood suggests a toxic atmosphere in 50 years
Cardiorespiratory fitness is associated with lower anger and anxiety
CasNum
Cash Issuing Terminals
Cash issuing terminals
Celebrating Tony Hoare's mark on computer science
Cell Service for the Fairly Paranoid
Ceno, browse the web without internet access
Changes to OpenTTD Distribution on Steam
Chaos and Dystopian news for the dead internet survivors
Chest Fridge (2009)
China's 450kmph bullet train is the fastest ever built
Chuck Norris has died
Claude Code LSP
Claude Code Remote Control
Claude Code conducts A/B tests on core features
Claude Code wiped our production database with a Terraform command
Claude Code, Claude Cowork and Codex #5
Claude Code: Channels
Claude now creates interactive charts, diagrams and visualizations
Claude struggles to cope with ChatGPT exodus
Clockwise acquired by Salesforce and shutting down next week
Closure of the Weatheradio Service in Canada
Closure of the Weatheradio service in Canada
Closure of the Weatherradio Service in Canada
Cloud VM benchmarks 2026
Cloudflare Crawl Endpoint
Cloudflare crawl endpoint
Cloudflare flags archive.today as "C&C/Botnet"; no longer resolves via 1.1.1.2
Cockpit is a web-based graphical interface for servers
Cognitive Debt: When Velocity Exceeds Comprehension
Common Lisp Development Tooling
Computer-generated dream world: Virtual reality for a 286 processor
Connecticut and the 1 Kilometer Effect
Contextual commits – An open standard for capturing the why in Git history
Converge (YC S23) Is Hiring a Founding Platform Engineer (NYC, Onsite)
Conway's Game of Life, in real life
Cook: A simple CLI for orchestrating Claude Code
Corgi Labs (YC W23) Is Hiring
Corruption erodes social trust more in democracies than in autocracies
Create value for others and don’t worry about the returns
Croatia declared free of landmines after 31 years
Cronboard: A terminal-based dashboard for managing cron jobs
Cross-Model Void Convergence: GPT-5.2 and Claude Opus 4.6 Deterministic Silence
Cursor Composer 2 is just Kimi K2.5 with RL
Customer Update on Simplenote
DARPA's new X-76 Experimental Plane
DHS Contracts Explorer – Hacked data from the Office of Industry Partnership
DOS Memory Management
Daily Driving GrapheneOS
Dan Simmons, author of Hyperion, Song of Kali, dead at 77
Dan Simmons, author of Hyperion, has died
Danish Gov agency to ditch Microsoft software in push for digital independence
Danish government agency to ditch Microsoft software (2025)
Dario Amodei calls OpenAI’s messaging around military deal ‘straight up lies’
Data Has Weight but Only on SSDs
Dataframe 1.0.0.0
Datasets for Reconstructing Visual Perception from Brain Data
Dear Time Lords: Freeze Computers in 1993
Debian decides not to decide on AI-generated contributions
Decimal-Java is a library to convert java.math.BigDecimal to and from IEEE-754r
Decision trees – the unreasonable power of nested decision rules
Defeat as Method
Delphi 13.1 Released, with ARM64 support
Democracy in 2025: on rising authoritarianism in the United States
Denmark was reportedly preparing for full-scale war with the US over Greenland
Denver dumps Flock, awards contract to Axon
Department of War Designates Anthropic Supply Chain Risk
Devirtualization and Static Polymorphism
Digg is gone again
Digg.com Closing Due to Spam
Discontinuation and reinitiation of dual-labeled GLP-1 receptor agonists
Discord cuts ties with Peter Thiel-backed verification software
Do AI Agents Make Money in 2026? Or Is It Just Mac Minis and Vibes?
Do Not Turn Child Protection into Internet Access Control
Does that use a lot of energy?
Dolphin Progress Release 2603
Don't Make Me Talk to Your Chatbot
Don't become an engineering manager
Don't post generated/AI-edited comments. HN is for conversation between humans.
Don't run OpenClaw on your main machine
Don't trust AI agents
Don't use passkeys for encrypting user data
Dragon Ball Color Correction Process [pdf]
Dream Recorder AI – a portal to your subconscious
Drugwars for the TI-82/83/83 Calculators (2011)
Durdraw – ANSI art editor for Unix-like systems
Dyson settles forced labour suit in landmark UK case
ECS Survivors Parts VII – X
EVi, a Hard-Fork of Vim
Ed Zitron loses his mind annotating an AI doomer macro memo
Effort to prevent government officials from engaging in prediction markets
Elevated Errors in Claude.ai
Elite Overproduction
Emuko: Fast RISC-V emulator written in Rust, boots Linux
Eniac, the First General-Purpose Digital Computer, Turns 80
Entomologists use a particle accelerator to image ants at scale
Entso-E final report on Iberian 2025 blackout
EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages
Ethiopia gets $350M World Bank financing for its digital ID project (2024)
Event Horizon Labs (YC W24) Is Hiring
Everett shuts down Flock camera network after judge rules footage public record
Every layer of review makes you 10x slower
Everything Changes, and Nothing Changes
Evolving descriptive text of mental content from human brain activity
Excel incorrectly assumes that the year 1900 is a leap year
Experts sound alarm after ChatGPT Health fails to recognise medical emergencies
Extending single-minus amplitudes to gravitons
F-Droid Board of Directors nominations 2026
FBI is buying location data to track US citizens, director confirms
FCC chairman threatens TV broadcast licenses over news coverage
FFmpeg 101 (2024)
FFmpeg 8.1
FFmpeg-over-IP – Connect to remote FFmpeg servers
Factory Logic
Fed's Cook says AI triggering big changes, sees possible unemployment rise
Federal Right to Privacy Act – Draft legislation
Federal data breach may be the biggest hack in US history
Felix "fx" Lindner has died
Fentanyl makeover: Core structural redesign could lead to safer pain medications
Firefox 148 Launches with AI Kill Switch Feature and More Enhancements
First MacBook Neo Benchmarks Are In
First Website
First Website (1992)
First-ever in-utero stem cell therapy for fetal spina bifida repair is safe
Five Years of Running a Systems Reading Group at Microsoft
Fixfest is a global gathering of repairers, tinkerers, and activists
Flash-Moe: Running a 397B Parameter Model on a Mac with 48GB RAM
Flightradar24 for Ships
Floci – A free, open-source local AWS emulator
Following 35% growth, solar has passed hydro on US grid
Fontcrafter: Turn Your Handwriting into a Real Font
Footage shows US citizen shot dead by ICE agent in Texas traffic stop
Forget Flags and Scripts: Just Rename the File
FrameBook
FreeBSD 14.4-Release Announcement
Full Disclosure: A Third (and Fourth) Azure Sign-In Log Bypass Found
Fungal Electronics (2021)
Fyn: An uv fork with new features, bug fixes, stripped telemetry
GLiNER2: Unified Schema-Based Information Extraction
GNU Texmacs
GPL upgrades via section 14 proxy delegation
GPT 5.4 Thinking and Pro
GPT-5.4 Thinking System Card
GPT-5.4 Thinking and GPT-5.4 Pro
GPT‑5.3 Instant
GPT‑5.4 Mini and Nano
Game about Data of America
Games with loot boxes to get minimum 16 age rating across Europe
Generating All 32-Bit Primes (Part I)
Generative AI Use and Depressive Symptoms Among US Adults
Get Shit Done: A Meta-Prompting, Context Engineering and Spec-Driven Dev System
Get free Claude max 20x for open-source maintainers
Getting Started in Common Lisp
Ghostling
Ghostmd: Ghostty but for Markdown Notes
Ghostty – Terminal Emulator
GitHub appears to be struggling with measly three nines availability
Go is the best language for agents
GoGoGrandparent (YC S16) is hiring Back end Engineers
Goodbye InnerHTML, Hello SetHTML: Stronger XSS Protection in Firefox 148
Google API Keys Weren't Secrets. But Then Gemini Changed the Rules
Google API keys weren't secrets, but then Gemini changed the rules
Google Engineers Launch "Sashiko" for Agentic AI Code Review of the Linux Kernel
Google Street View in 2026
Google Workers Seek 'Red Lines' on Military A.I., Echoing Anthropic
Google Workspace CLI
Google adds 24-hour wait and mandatory reboot to Android sideloading flow
Google just gave Sundar Pichai a $692M pay package
Government grant-funded research should not be published in for-profit journals
Grace Hopper's Revenge
Grafeo – A fast, lean, embeddable graph database built in Rust
Grandparents are glued to their phones, families are worried [video]
GrapheneOS will remain usable by anyone without requiring personal information
Graphics Programming Resources
Grief and the AI split
GrobPaint: Somewhere Between MS Paint and Paint.net. Multiplatform by Default
Gvisor on Raspbian
HP trialed mandatory 15-minute support call wait times (2025)
Hacking an old Kindle to display bus arrival times
Hammerspoon
Hardening Firefox with Anthropic's Red Team
Have a Fucking Website
Have a fucking website
Having Kids (2019)
Hazardous substances found in all headphones tested by ToxFREE project
He saw an abandoned trailer. Then, uncovered a surveillance network
Helix: A post-modern text editor
Helsinki just went a full year without a single traffic death
Hightouch (YC S19) Is Hiring
HopTab–free,open source macOS app switcher and tiler that replaces Cmd+Tab
Hormuz Minesweeper – Are you tired of winning?
Hostile Volume – A game about adjusting volume with intentionally bad UI
How AI skills are quietly automating my workday
How BYD Got EV Chargers to Work Almost as Fast as Gas Pumps
How I write software with LLMs
How Kernel Anti-Cheats Work
How Lego builds a new Lego set
How We Synchronized Editing for Rec Room's Multiplayer Scripting System
How Will OpenAI Compete?
How do I cancel my ChatGPT subscription?
How the Government Deceived Congress in the Debate over Surveillance Powers (2013)
How the Sriracha guys screwed over their supplier
How to Build Your Own Quantum Computer
How to Not Pay Your Taxes
How to Record and Retrieve Anything You've Ever Had to Look Up Twice
How to install and start using LineageOS on your phone
How to record and retrieve anything you've ever had to look up twice
How to run Qwen 3.5 locally
How to talk to anyone and why you should
How we rebuilt Next.js with AI in one week
How will OpenAI compete?
HuggingFace Agent Skills
Human Rights Watch says drone strikes in Haiti have killed nearly 1,250 people
Hydroph0bia – a fixed SecureBoot bypass for UEFI firmware based on Insyde H2O
Hydroph0bia – fixed SecureBoot bypass for UEFI firmware from Insyde H2O (2025)
Hyperlinks in Terminal Emulators
I Built a Scheme Compiler with AI in 4 Days
I Found 39 Algolia Admin Keys Exposed Across Open Source Documentation Sites
I Pitched a Roller Coaster to Disneyland at Age 10 in 1978
I Ported Coreboot to the ThinkPad X270
I am directing the Department of War to designate Anthropic a Supply-Chain Risk
I am directing the Department of War to designate Anthropic a supply-chain risk
I beg you to follow Crocker's Rules, even if you will be rude to me
I built a demo of what AI chat will look like when it's "free" and ad-supported
I built a pint-sized Macintosh
I built a programming language using Claude Code
I don't know how you get here from "predict the next word."
I found 39 Algolia admin keys exposed across open source documentation sites
I pitched a roller coaster to Disneyland at age 10 in 1978
I ported Linux to the PS5 and turned it into a Steam Machine
I put my whole life into a single database
I resigned from OpenAI
I think WebRTC is better than SSH-ing for connecting to Mac terminal from iPhone
I traced $2B in grants and 45 states' lobbying behind age‑verification bills
I'm helping my dog vibe code games
I'm reluctant to verify my identity or age for any online services
IBM, sonic delay lines, and the history of the 80×24 display
IRS Tactics Against Meta Open a New Front in the Corporate Tax Fight
If AI writes code, should the session be part of the commit?
If you thought the code writing speed was your problem; you have bigger problems
Illinois Introducing Operating System Account Age Bill
Image generation models can think
In 2025, Meta paid an effective federal tax rate of 3.5%
In Memoriam: John W. Addison, my PhD advisor
In Praise of Stupid Questions
Indefinite Book Club Hiatus
Independent Geophysical Forensic Analysis of the Nordstream Pipeline Sabotage
India's top court angry after junior judge cites fake AI-generated orders
Inferring Car Movement Patterns from Passive TPMS Measurements
Inferring car movement patterns from passive TPMS measurements
Innocent woman jailed after being misidentified using AI facial recognition
Intel Foundry boss leaves for Qualcomm
Intel XeSS 3: expanded support for Core Ultra/Core Ultra 2 and Arc A, B series
Intel's make-or-break 18A process node debuts for data center with 288-core Xeon
Intelligence is a commodity. Context is the real AI Moat
Intent-Based Commits
Intuitions for Tranformer Circuits
Iran launched unsuccessful attack on UK's Diego Garcia
Iran war energy shock sparks global push to reduce fossil fuel dependence
Iran war wreaking havoc on shipping and air cargo, could create global delays
Iran's Ayatollah Ali Khamenei is killed in Israeli strike, ending 36-year rule
Iran's attacks on Amazon data centers in UAE, Bahrain signal a new kind of war
Iran-backed hackers claim wiper attack on medtech firm Stryker
Ireland shuts last coal plant, becomes 15th coal-free country in Europe (2025)
Isotopic Evidence for a Cold and Distant Origin of Interstellar Object 3I/Atlas
Israel launches strike against Iran, declares state of emergency across country
It looks like the “JVG algorithm” only wins on tiny numbers
JSLinux Now Supports x86_64
Jails for NetBSD – Kernel Enforced Isolation and Native Resource Control
Jane Street Hit with Terra $40B Insider Trading Suit
January in Servo: preloads, better forms, details styling, and more
Java 26 is here
Java is fast, code might not be
Jazz CRJ9 at New York on Mar 22nd 2026, collision with fire truck on runway
Jeff Bezos Upended the Washington Post
Jemalloc un-abandoned by Meta
Jensen Huang says Nvidia is pulling back from OpenAI and Anthropic
Jepsen: MariaDB Galera Cluster 12.1.2
Jiga (YC W21) Is Hiring
Jimi Hendrix was a systems engineer
John Carmack about open source and anti-AI activists
Jolla on track to ship new phone with Sailfish OS, user-replaceable battery
Jolla phone – a full-stack European alternative
Judge finalizes order for Greenpeace to pay $345M in ND oil pipeline case
Just two days of oatmeal cut bad cholesterol by 10%
Kagi Small Web
Kagi Translate now supports LinkedIn Speak as an output language
Kaizen (YC P25) Hiring Eng, GTM, Cos to Automate BPOs
Kangina
Kansai Airport has never lost a baggage in the 30 years since it opened
Khamenei Dead
Ki Editor - an editor that operates on the AST
Kin: Semantic version control that tracks code as entities, not files
Kyber (YC W23) Is Hiring an Enterprise Account Executive
LLM Writing Tropes.md
LLM=True
LLMs can be exhausting
LLMs can unmask pseudonymous users at scale with surprising accuracy
LLMs work best when the user defines their acceptance criteria first
Labor market impacts of AI: A new measure and early evidence
Language Model Contains Personality Subnetworks
Language Model Teams as Distrbuted Systems
Language model teams as distributed systems
Larry Page has moved to Florida
Latency numbers every programmer should know
Launch HN: Captain (YC W26) – Automated RAG for Files
Launch HN: Cardboard (YC W26) – Agentic video editor
Launch HN: Cekura (YC F24) – Testing and monitoring for voice and chat AI agents
Launch HN: Chamber (YC W26) – An AI Teammate for GPU Infrastructure
Launch HN: Didit (YC W26) – Stripe for Identity Verification
Launch HN: IonRouter (YC W26) – High-throughput, low-cost inference
Launch HN: Kita (YC W26) – Automate credit review in emerging markets
Launch HN: OctaPulse (YC W26) – Robotics and computer vision for fish farming
Launch HN: Palus Finance (YC W26): Better yields on idle cash for startups, SMBs
Launch HN: Prism (YC X25) – Workspace and API to generate and edit videos
Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon
Launch HN: Sentrial (YC W26) – Catch AI Agent Failures Before Your Users Do
Launch HN: Sitefire (YC W26) – Automating actions to improve AI visibility
Launch HN: TeamOut (YC W22) – AI agent for planning company events
Launch HN: Terminal Use (YC W26) – Vercel for filesystem-based agents
Launch HN: Vela (YC W26) – AI for complex scheduling
Launch HN: Voltair (YC W26) – Drone and charging network for power utilities
Launch HN: Voygr (YC W26) – A better maps API for agents and AI apps
Launch an autonomous AI agent with sandboxed execution in 2 lines of code
Launching the Claude Partner Network
Lawmakers say US Military used laser to take down Border Protection drone in TX
Layoffs at Block
Leanstral: Open-source agent for trustworthy coding and formal proof engineering
Learning Creative Coding
Learning athletic humanoid tennis skills from imperfect human motion data
Learnings from paying artists royalties for AI-generated art
Leaving Google has actively improved my life
Lenovo's New ThinkPads Score 10/10 for Repairability
Let's Get Physical
Let's discuss sandbox isolation
LibreOffice Writer now supports Markdown
LibreSprite – open-source pixel art editor
Libxml2 Enterprise Edition (AGPL, from the previous maintainer)
Lil Finder Guy
Lil' Fun Langs' Guts
Linux Applications Programming by Example: The Fundamental APIs (2nd Edition)
Linux Internals: How /proc/self/mem writes to unwritable memory (2021)
Lisp-style C++ template meta programming
LiteLLM (YC W23): Founding Reliability Engineer – $200K-$270K and 0.5-1.0% equity
Little Free Library
Little Free Library Books
Living human brain cells play DOOM on a CL1 [video]
LoGeR – 3D reconstruction from extremely long videos (DeepMind, UC Berkeley)
Looks like it is happening
Lost Doctor Who Episodes Found
Love of corporate bullshit is correlated with bad judgment
MAUI Is Coming to Linux
MCP is dead. Long live the CLI
Mac external displays for designers and developers, part 2 (2022)
MacBook M5 Pro and Qwen3.5 = Local AI Security System
Mac mini will be made at a new facility in Houston
Making Firefox's right-click not suck with about:config
Making MCP cheaper via CLI
Making Wolfram tech available as a foundation tool for LLM systems
Manjaro website off-line again due to lapsed certificate
Many SWE-bench-Passing PRs would not be merged
Measuring progress toward AGI: A cognitive framework
Medical journal says the case reports it has published for 25 years are fiction
Megadev: A Development Kit for the Sega Mega Drive and Mega CD Hardware
Men in their 50s may be aging faster due to toxic 'forever chemicals'
Mercury 2: Fast reasoning LLM powered by diffusion
Mercury 2: The fastest reasoning LLM, powered by diffusion
Mesh over Bluetooth LE, TCP, or Reticulum
Meta Horizon Worlds on Meta Quest is being discontinued
Meta and TikTok let harmful content rise to drove engagement, say whistleblowers
Meta’s AI smart glasses and data privacy concerns
Meta’s renewed commitment to jemalloc
Meticulous (YC S21) is hiring to redefine software dev
Michael Pollan punctures the AI bubble
Microgpt
Microslop Manifesto
Microsoft BitNet: 100B Param 1-Bit model for local CPUs
Microsoft Creative Writer (1993)
Microsoft bans the word "Microslop" on its Discord, then locks the server
Microsoft's 'unhackable' Xbox One has been hacked by 'Bliss'
Migrating the American Express Payment Network, Twice
Migrating to the EU
Militaries are scrambling to create their own Starlink
MinIO Is Dead, Long Live MinIO
Mistral AI Releases Forge
MitID, Denmarks sole digital ID, has been down for over an hour and counting
Monkey Island for Commodore 64 Ground Up
MonoGame: A .NET framework for making cross-platform games
More common mistakes to avoid when creating system architecture diagrams
Most of the US economy is in a recession
Motorola announces a partnership with GrapheneOS Foundation
Mouser: An open source alternative to Logi-Plus mouse software
Mozilla to launch free built-in VPN in upcoming Firefox 149
Mullvad VPN: Banned TV Ad in the Streets of London [video]
Multifactor (YC F25) Is Hiring an Engineering Lead
My Homelab Setup
NASA announces major overhaul of Artemis program amid safety concerns, delays
NASA announces overhaul of Artemis program amid safety concerns, delays
NMAP in the Movies
NRC Issues First Commercial Reactor Construction Approval in 10 Years [pdf]
NRC issues first commercial reactor construction approval in 10 years [pdf]
Nango (YC W23, API Access for Agents and Apps) Is Hiring
Nano Banana 2: Google's latest AI image generation model
NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
Nasdaq's Shame
Ndea (YC W26) is hiring a symbolic RL search guidance lead
Nearby Glasses
Netflix Backs Out of Warner Bros. Bidding, Paramount Set to Win
Networking with agents: Put them in the right conversations with Tailscale
Never Bet Against x86
Never Buy A .online Domain
New AirSnitch attack breaks Wi-Fi encryption in homes, offices, and enterprises
New California law requires age verification for all OS accounts
New York could prohibit chatbot medical, legal, engineering advice
New accounts on HN 10x more likely to use em-dashes
New iron nanomaterial wipes out cancer cells without harming healthy tissue
Nightingale – open-source karaoke app that works with any song on your computer
Nihilistic Violent Extremism
No evidence cannabis helps anxiety, depression, or PTSD
No right to relicense this project
No, it doesn't cost Anthropic $5k per Claude Code user
Nobody Gets Promoted for Simplicity
Node.js needs a virtual file system
Noq: n0's new QUIC implementation in Rust
North Korean's 100k fake IT workers net $500M a year for Kim
Notes on Baking at the South Pole
Notes on Lagrange Interpolating Polynomials
Notes on Writing WASM
Number Research Inc
Number of UK workers on zero-hours contracts hits record high ahead of crackdown
Nvidia Launches Vera CPU, Purpose-Built for Agentic AI
Nvidia NemoClaw
Nvidia PersonaPlex 7B on Apple Silicon: Full-Duplex Speech-to-Speech in Swift
Nvidia backs AI data center startup Nscale as it hits $14.6B valuation
Obsidian Sync now has a headless client
Office.eu launches as Europe's sovereign office platform
Oil Is Near a Price That Hurts the Economy
Oil nears $110 a barrel after gas field strike
Online astroturfing: A problem beyond disinformation
Open Letter to Google on Mandatory Developer Registration for App Distribution
Open Source Endowment – new funding source for open source maintainers
Open source calculator firmware DB48X forbids CA/CO use due to age verification
OpenAI Fires an Employee for Prediction Market Insider Trading
OpenAI agrees with Dept. of War to deploy models in their classified network
OpenAI fires an employee for prediction market insider trading
OpenAI is walking away from expanding its Stargate data center with Oracle
OpenAI reaches deal to deploy AI models on U.S. DoW classified network
OpenAI resets spending expectations, from $1.4T to $600B
OpenAI – How to delete your account
OpenAI's $110B funding round (investments from Amazon, Nvidia, SoftBank)
OpenAI, the US government and Persona built an identity surveillance machine
OpenBSD on SGI: A Rollercoaster Story
OpenClaw Surpasses React to Become the Most-Starred Software Project on GitHub
OpenClaw is a security nightmare dressed up as a daydream
OpenCode – Open source AI coding agent
OpenCode – The open source AI coding agent
OpenSUSE Kalpa
OpenTitan Shipping in Production
Operational issue – Multiple services (UAE)
Optimizing Content for Agents
Oracle is building yesterday's data centers with tomorrow's debt
Oregon school cell phone ban: 'Engaged students, joyful teachers'
Origin of the rule that swap size should be 2x of the physical memory
Osaka: Kansai Airport proud to have never lost single piece of luggage (2024)
OsmAnd's Faster Offline Navigation
OsmAnd's Faster Offline Navigation (2025)
Otters as Bioindicators of Estuarine Health
Our Agreement with the Department of War
Our Computer Using agent just solved CAPTCHA up to Level 6
Our Experience with I-Ready
PA Bench: Evaluating Frontier Models on Multi-Tab Pa Tasks
PA bench: Evaluating web agents on real world personal assistant workflows
PC Gamer recommends RSS readers in a 37mb article that just keeps downloading
PC processors entered the Gigahertz era today in the year 2000 with AMD's Athlon
POSSE – Publish on your Own Site, Syndicate Elsewhere
Packaging a Gleam app into a single executable
Palantir extends reach into British state as gets access to sensitive FCA data
Palantir's AI Is Playing a Major Role in Tracking Gaza Aid Deliveries
Palm OS User Interface Guidelines (2003) [pdf]
Palm OS User Interface Guidelines [pdf, 2003]
Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration
Parallel coding agents with tmux and Markdown specs
Parallels confirms MacBook Neo can run Windows in a virtual machine
Pass-Through of Tariffs: Evidence from European Wine Imports
Passengers who refuse to use headphones can now be kicked off United flights
Paul Brainerd, Founder of Aldus PageMaker, has died
Pentagon chief blocks officers from Ivy League schools and top universities
Pentagon expands oversight of Stars and Stripes, limits content
Pentagon threatens to make Anthropic a pariah
Personal Computer by Perplexity
Philosoph Jürgen Habermas Gestorben
Physicists Trace Sun's Magnetic Engine, 200k Kilometers Below Surface
Physicists developing a quantum computer that’s entirely open source
Physics Girl: Super-Kamiokande – Imaging the sun by detecting neutrinos [video]
Pi – A minimal terminal coding harness
Pi – a minimal terminal coding harness
Plasma Bigscreen – 10-foot interface for KDE plasma
Plastic is made from milk and it vanishes in 13 weeks
Please do not use auto-scrolling content on the web and in applications
Please, please, please stop using passkeys for encrypting user data
Poll: Code with AI or Not?
Polymarket gamblers threaten to kill me over Iran missile story
Poor Man's Polaroid
Porn depicting sex between step-relatives set to be banned in the UK
Possible US Government iPhone-Hacking Toolkit in foreign spy and criminal hands
PostmarketOS in 2026-02: generic kernels, bans use of generative AI
Preliminary data from a longitudinal AI impact study
President Trump bans Anthropic from use in government systems
Privacy-preserving age and identity verification via anonymous credentials
Process-Based Concurrency: Why Beam and OTP Keep Being Right
Professional video editing, right in the browser with WebGPU and WASM
Profiling Hacker News users based on their comments
Prompt-caching – auto-injects Anthropic cache breakpoints (90% token savings)
Proton Mail Helped FBI Unmask Anonymous 'Stop Cop City' Protester
Psychology: Who dont maintain many close friends, learned independence too early
Push events into a running session with channels
Put the zip code first
Python 3.15's JIT is now back on track
QGIS 4.0
Qt45: A small polymerase ribozyme that can synthesize itself
Quillx is an open standard for disclosing AI involvement in software projects
Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers
RAM kits are now sold with one fake RAM stick alongside a real one
RAM now represents 35 percent of bill of materials for HP PCs
RFC 9849. TLS Encrypted Client Hello
RISC-V Is Sloooow
RX – a new random-access JSON alternative
Rack-Mount Hydroponics
Rack-mount hydroponics
Racket v9.1
Red Dwarf creator Rob Grant has died
Red Hat takes on Docker Desktop with its enterprise Podman Desktop build
Reddit User Uncovers Who Is Behind Meta's $2B Lobbying for Age Verification Tech
Redox OS has adopted a Certificate of Origin policy and a strict no-LLM policy
Reflex (YC W23) Is Hiring Software Engineers – Python
Relax NG is a schema language for XML (2014)
Reliable Software in the LLM Era
Relicensing with AI-Assisted Rewrite
Remotely unlocking an encrypted hard disk
Rendezvous with Rama
Revealed: Face of 75,000-year-old female Neanderthal from cave
Reverse-engineering Viktor and making it Open Source
Reversing memory loss via gut-brain communication
Right-sizes LLM models to your system's RAM, CPU, and GPU
Rob Grant, creator of Red Dwarf, has died
Rob Pike's 5 Rules of Programming
Roboflow (YC S20) Is Hiring a Security Engineer for AI Infra
Robotocore · a Digital Twin of AWS
Runners who churn butter on their runs
Running a One Trillion-Parameter LLM Locally on AMD Ryzen AI Max+ Cluster
Rust Is Just a Tool
Rust is just a tool
Ryugu asteroid samples contain all DNA and RNA building blocks
SBCL Fibers – Lightweight Cooperative Threads
SBCL: A Sanely-Bootstrappable Common Lisp (2008) [pdf]
SIM (YC X25) Is Hiring the Best Engineers in San Francisco
SSH has no Host header
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI
Samsung Galaxy update removes Android recovery menu tools, including sideloading
Sandboxes won't save you from OpenClaw
Sarvam 105B, the first competitive Indian open source LLM
Sashiko: An agentic Linux kernel code review system
Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster
Scientists discover a surprising way to quiet the anxious mind (2025)
Scrt: A CLI secret manager for developers, sysadmins and DevOps
Sea level much higher than assumed in most coastal hazard assessments
Searching for the Agentic IDE
Seed of Might Color Correction Process (2023) [pdf]
Self-improving software won't produce Skynet
Sem – Semantic version control. Entity-level diffs on top of Git
Senior European journalist suspended over AI-generated quotes
Separating the Wayland compositor and window manager
Setting up OpenClaw on a cloud VM
Shall I implement it? No
Show HN: A context-aware permission guard for Claude Code
Show HN: A real-time strategy game that AI agents can play
Show HN: ANSI-Saver – A macOS Screensaver
Show HN: Agent Swarm – Multi-agent self-learning teams (OSS)
Show HN: Algorithms and Data Structures in TypeScript – Free Book (~400 Pages)
Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go
Show HN: Atomic – self-hosted, semantically-connected personal knowledge base
Show HN: Audio Toolkit for Agents
Show HN: Aurion OS – A 32-bit GUI operating system written from scratch in C
Show HN: Autoresearch@home
Show HN: Badge that shows how well your codebase fits in an LLM's context window
Show HN: Baltic shadow fleet tracker – live AIS, cable proximity alerts
Show HN: Better Hub – A better GitHub experience
Show HN: Claude-File-Recovery, recover files from your ~/.claude sessions
Show HN: Claude-replay – A video-like player for Claude Code sessions
Show HN: Clocksimulator.com – A minimalist, distraction-free analog clock
Show HN: Context Gateway – Compress agent context before it hits the LLM
Show HN: Context Mode – 315 KB of MCP output becomes 5.4 KB in Claude Code
Show HN: Curiosity – DIY 6" Newtonian Reflector Telescope
Show HN: Decided to play god this morning, so I built an agent civilisation
Show HN: Deff – side-by-side Git diff review in your terminal
Show HN: Django Control Room – All Your Tools Inside the Django Admin
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
Show HN: Emdash – Open-source agentic development environment
Show HN: Eyot, A programming language where the GPU is just another thread
Show HN: Free OpenAI API Access with ChatGPT Account
Show HN: GDSL – 800 line kernel: Lisp subset in 500, C subset in 1300
Show HN: Gapless.js – gapless web audio playback
Show HN: Giggles – A batteries-included React framework for TUIs
Show HN: Global Maritime Chokepoints
Show HN: Govbase – Follow a bill from source text to news bias to social posts
Show HN: GrobPaint: Somewhere Between MS Paint and Paint.net
Show HN: Han – A Korean programming language written in Rust
Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust
Show HN: I Was Here – Draw on street view, others can find your drawings
Show HN: I built 48 lightweight SVG backgrounds you can copy/paste
Show HN: I built a real-time OSINT dashboard pulling 15 live global feeds
Show HN: I built a site where you hire yourself instead of applying for jobs
Show HN: I built a sub-500ms latency voice agent from scratch
Show HN: I built a tool that watches webpages and exposes changes as RSS
Show HN: I ported Tree-sitter to Go
Show HN: I'm 15. I mass published 134K lines to hold AI agents accountable
Show HN: Joonote – A note-taking app on your lock screen and notification panel
Show HN: Klaus – OpenClaw on a VM, batteries included
Show HN: Kula – Lightweight, self-contained Linux server monitoring tool
Show HN: Learn Arabic with spaced repetition and comprehensible input
Show HN: Logira – eBPF runtime auditing for AI agent runs
Show HN: Mcp2cli – One CLI for every API, 96-99% fewer tokens than native MCP
Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3
Show HN: Now I Get It – Translate scientific papers into interactive webpages
Show HN: Omni – Open-source workplace search and chat, built on Postgres
Show HN: OneCLI – Vault for AI Agents in Rust
Show HN: Online OCR Free – Batch OCR UI for Tesseract, Gemini and OpenRouter
Show HN: Open-source playground to red-team AI agents with exploits published
Show HN: OpenClaw-class agents on ESP32 (and the IDE that makes it possible)
Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub
Show HN: Out Plane – A PaaS I built solo from Istanbul in 3 months
Show HN: PHP 8 disable_functions bypass PoC
Show HN: PageAgent, A GUI agent that lives inside your web app
Show HN: Pianoterm – Run shell commands from your Piano. A Linux CLI tool
Show HN: Playing LongTurn FreeCiv with Friends
Show HN: Poppy – a simple app to stay intentional with relationships
Show HN: Reclaim Flowers – A 2D physics-based "Digital Altar" protocol
Show HN: Recursively apply patterns for pathfinding
Show HN: Red Grid Link – peer-to-peer team tracking over Bluetooth, no servers
Show HN: RetroTick – Run classic Windows EXEs in the browser
Show HN: Rev-dep – 20x faster knip.dev alternative build in Go
Show HN: Revise – An AI Editor for Documents
Show HN: Rust-powered document chunker for RAG – 40x faster, O(1) memory
Show HN: SNKV – SQLite's B-tree as a key-value store (C/C++ and Python bindings)
Show HN: SQLite for Rivet Actors – one database per agent, tenant, or document
Show HN: Sgai – Goal-driven multi-agent software dev (GOAL.md → working code)
Show HN: SignalCend – API that resolves conflicting IoT device state in 47ms
Show HN: Signet – Autonomous wildfire tracking from satellite and weather data
Show HN: Simple plugin to get Claude Code to listen to you
Show HN: Skir – like Protocol Buffer but better
Show HN: Sonar – A tiny CLI to see and kill whatever's running on localhost
Show HN: SplatHash – A lightweight alternative to BlurHash and ThumbHash
Show HN: Steerling-8B, a language model that can explain any token it generates
Show HN: Swarm – Program a colony of 200 ants using a custom assembly language
Show HN: Terminal Phone – E2EE Walkie Talkie from the Command Line
Show HN: Terminal-Style Portfolio on the Internet
Show HN: The King Wen Permutation: [52, 10, 2]
Show HN: The Mog Programming Language
Show HN: The Roman Industrial Revolution that could have been (Vol 2)
Show HN: Timber – Ollama for classical ML models, 336x faster than Python
Show HN: Tmux-IDE, OSS agent-first terminal IDE
Show HN: Tomoshibi – A writing app where your words fade by firelight
Show HN: Trackm, a personal finance web app
Show HN: Understudy – Teach a desktop agent by demonstrating a task once
Show HN: VS Code Agent Kanban: Task Management for the AI-Assisted Developer
Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids
Show HN: Vertex.js – A 1kloc SPA Framework
Show HN: We built a terminal-only Bluesky / AT Proto client written in Fortran
Show HN: Web Audio Studio – A Visual Debugger for Web Audio API Graphs
Show HN: Will my flight have Starlink?
Show HN: X86CSS – An x86 CPU emulator written in CSS
Show HN: Xmloxide – an agent made rust replacement for libxml2
Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts
Show HN: enveil – hide your .env secrets from prAIng eyes
Show HN: fftool – A Terminal UI for FFmpeg – Shows Command Before It Runs
Show HN: s@: decentralized social networking over static sites
Show HN: uBlock filter list to blur all Instagram Reels
SigNoz (YC W21, open source Datadog) Is Hiring across roles
Smalltalk's Browser: Unbeatable, yet Not Enough
Smartphone Mkt to Decline 13% in '26, Largest Drop Ever Due to Memory Shortage
Smartphone market forecast to decline this year due to memory shortage
So you want to write an “app” (2025)
Software 3.1? – AI Functions
Some Things Just Take Time
Someone Needs to Go to Jail
Something is afoot in the land of Qwen
Source code of Swedish e-government services has been leaked
South Korean Police Lose Seized Crypto by Posting Password Online
Speculative Speculative Decoding (SSD)
Speed at the cost of quality: Study of use of Cursor AI in open source projects
Spice Data (YC S19) Is Hiring a Product Specialist
Spotify playing ads for paid subscribers
Standardizing source maps
Stanford researchers report first recording of a blue whale's heart rate (2019)
Stardex (YC S21) is hiring customer success engineers
Statement from Dario Amodei on Our Discussions with the Department of War
Statement from Dario Amodei on our discussions with the Department of War
Statement on the comments from Secretary of War Pete Hegseth
Steel Bank Common Lisp
Stop Using Grey Text (2025)
Storing 2 bytes of data in your Logitech mouse
Story of XZ Backdoor [video]
Stripe reportedly makes offer to acquire PayPal
Stripe valued at $159B, 2025 annual letter
Structured AI (YC F25) Is Hiring
Sunsetting Jazzband
Super Micro Shares Plunge 25% After Co-Founder Charged in $2.5B Smuggling Plot
Supertoast tables
Surpassing vLLM with a Generated Inference Stack
Switch to Claude without starting over
Switzerland Built an Alternative to BGP
SynthID
System76 on Age Verification Laws
TADA: Fast, Reliable Speech Generation Through Text-Acoustic Synchronization
TUI Studio – visual terminal UI design tool
Talos: Hardware accelerator for deep convolutional neural networks
TeX Live 2026 is available for download now
Teaching Claude to QA a mobile app
Tech Companies Shouldn't Be Bullied into Doing Surveillance
Tech companies shouldn't be bullied into doing surveillance
Tech employment now significantly worse than the 2008 or 2020 recessions
Technical Excellence Is Not Enough
Techno‑Feudal Elite Are Attempting to Build a Twenty‑First‑Century Fascist State
Tell HN: Apple development certificate server seems down?
Tell HN: I'm 60 years old. Claude Code has re-ignited a passion
Tell HN: YC companies scrape GitHub activity, send spam emails to users
Temporal: A nine-year journey to fix time in JavaScript
Tenth Circuit: 4th Amendment Doesn't Support Broad Search of Protesters' Devices
Tesla registrations crash 17% in Europe as BEV market surges 14%
The 185-Microsecond Type Hint
The 49MB web page
The Accidental Room (2018)
The Appalling Stupidity of Spotify's AI DJ
The Appeal and Reality of Recycling LoRAs with Adaptive Merging
The Banality of Surveillance
The Brand Age
The Day I Discovered Type Design
The Enterprise Context Layer
The Excommunicated Devs Making Games with AI
The Future of AI
The Future of Version Control
The Government Uses Targeted Advertising to Track Your Location
The Hunt for Dark Breakfast – Can we derive breakfasts we have never observed?
The Hunt for Dark Breakfast
The Impact of AI on Game Dev Jobs. Open to Work Crisis
The L in "LLM" Stands for Lying
The Life Cycle of Money
The Linux Programming Interface as a university course text
The Millisecond That Could Change Cancer Treatment
The Misuses of the University
The Most-Seen UI on the Internet? Redesigning Turnstile and Challenge Pages
The Om Programming Language
The Pentagon Feuding with an AI Company Is a Bad Sign
The Pentagon Threatens Anthropic
The Pentagon is making a mistake by threatening Anthropic
The Reason Windows Hate Is Exploding: It's the End of Personal Computing [video]
The Robotic Dexterity Deadlock
The Science of Detecting LLM-Generated Text
The Science of Detecting LLM-Generated Text (2024)
The Shady World of IP Leasing
The Slow Death of the Power User
The Soul of a Pedicab Driver
The Three Pillars of JavaScript Bloat
The United States needs fewer bus stops
The View from RSS
The Webpage Has Instructions. The Agent Has Your Credentials
The Windows 95 User Interface: A Case Study in Usability Engineering
The Windows 95 user interface: A case study in usability engineering (1996)
The Wyden Siren Goes Off Again: We'll Be "Stunned" by NSA Under Section 702
The Xkcd thing, now interactive
The beauty and terror of modding Windows
The biggest theft in human history occurred in broad daylight
The changing goalposts of AGI and timelines
The complete Manic Miner disassembly
The death of social media is the renaissance of RSS (2025)
The first airplane fatality
The gold standard of optimization: A look under the hood of RollerCoaster Tycoon
The inner workings of TCP zero-copy
The next generations of Bubble Tea, Lip Gloss, and Bubbles are available now
The normalization of corruption in organizations (2003) [pdf]
The pleasures of poor product design
The return-to-the-office trend backfires
The stagnancy of publishing and the disappearance of the midlist
The strait of Hormuz blockade will strangle US defense industry
The surprising whimsy of the Time Zone Database
The war against PDFs is heating up
The whole thing was a scam
The workers behind Meta's smart glasses can see everything
The workers behind Meta’s smart glasses can see everything
The worst acquisition in history, again
The yoghurt delivery women combatting loneliness in Japan
The “JVG algorithm” only wins on tiny numbers
The “small web” is bigger than you might think
Theory of Constraints: "Blue Light" creating capacity for nothing (2007)
Thermal Grizzly was scammed twice on raw materials worth €40k
They're Vibe-Coding Spam Now
Thinking Fast, Slow, and Artificial: How AI Is Reshaping Human Reasoning
This System Can Go Fuck Itself and Burn in Hell
Throwing away 18 months of code and starting over
Time Is Different
Timeline: Anthropic, OpenAI, and U.S. Government
Tinnitus Is Connected to Sleep
Tinybox – Offline AI device 120B parameters
Tom Homan confirms ICE to be at airports starting Monday
Treasure hunter freed from jail after refusing to turn over shipwreck gold
Tree Search Distillation for Language Models Using PPO
Trellis AI (YC W24) is hiring deployment lead to accelerate medication access
Turing Completeness of GNU Find: From Mkdir-Assisted Loops to Standalone Comput
Turing Completeness of GNU find
Turns out Generative AI was a scam
Twitch: "Hey, come back! This commercial break can't play while you're away."
Two Years of Emacs Solo
Two Years of Emacs Solo: 35 Modules, Zero External Packages, and a Full Refactor
Two insider cases we've recently closed
U+237C ⍼ Is Azimuth
U.S. and Israel Conduct Strikes on Iran
U.S. science agency moves to restrict foreign scientists from its labs
UK's Ofcom has today fined 4chan £450k for not having age checks in place
UMD Scientists Create 'Smart Underwear' to Measure Human Flatulence
US Court of Appeals: TOS may be updated by email, use can imply consent [pdf]
US Court of Appeals: TOS may be updated by email, use may imply consent [pdf]
US Military leaders meet with Anthropic to argue against Claude safeguards
US SEC preparing to scrap quarterly reporting requirement
US and Israel launch strikes on Iran, as Trump says ‘massive’ campaign underway
US commercial insurers pay 254% of Medicare for the same hospital procedures
US economy sheds 92,000 jobs in February in sharp slide
US orders diplomats to fight data sovereignty initiatives
US private credit defaults hit record 9.2% in 2025, Fitch says

EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages

Evaluating Genuine Reasoning in LLMs via Esoteric Languages 🤖

guid

source_url

author_name

Evaluating Genuine Reasoning in LLMs via Esoteric Languages 🤖