Daily Digest — 2026-05-22

27 newsletters today.

In this digest

Abandoned Futures — The Convair X-11 and X-12 (SM-65 Atlas Testbeds) Aside — Today's Topic: The Lockheed CL-1201: The Nuclear-Powered Flying Aircraft Carrier Lockheed Actually Designed in 1969
ArXiv Paper Digest — DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback
Daily Automotive Engines — Oil Galleries and Pressurized Lubrication Routing
Daily Debugging Puzzle — Java's Arrays.asList(int[]) Trap: The List of One That Looks Like Four
Daily Digital Circuits — Static Timing Analysis Corners: Why Hardware Engineers Sign Off Chips Across Multiple Universes
Daily Electrical Circuits — Triangle Wave Generators: The Integrator-Comparator Loop
Daily Engineering Lesson — PWM (Pulse Width Modulation): How Digital Signals Control Analog Power
Forgotten Books — The 1935 Prescription Modern Parents Keep Forgetting: Let Children Be Bored
Forgotten Darkroom — The Lost Art of Bathing Photographic Plates in Dye to See New Colors
Forgotten Patent — Guglielmo Marconi's "Tuned Circuits" Patent: The 1900 Filing That Invented Wireless Channels — and Made the Radio Spectrum Usable
Daily GitHub Zero Stars — Engine-Room-Games/ERG-Generators
Daily Hardware Architecture — The Commit Buffer and Architectural State Update: How CPUs Make Speculation Permanent
Hacker News Deep Cuts — Phantasy Star IV – 1993 Developer Interviews
HN Jobs Teardown — Vocdoni: What Their Hiring Reveals
Daily Low-Level Programming — The Line Fill Buffer: How the CPU Tracks Cache Misses In Flight
Reddit Small Subs — Dividing via multiplicative inverse on RISC-V
RFC Deep Dive — RFC 3117: On the Design of Application Protocols
Stack Overflow Unanswered — Sync reset used as async reset
Daily Software Engineering — The Leader Election Pattern: Picking One Node to Do the Singleton Work
Tool Nobody Knows — parallel: GNU parallel's Tricks That Aren't Just xargs -P
What If Engineering — What If We Built a Mass Driver Up the Side of a Mountain to Launch Satellites?
Wikipedia Rabbit Hole — Rhenium
Daily YT Documentary — Rob Beckett talks about traditional blacksmiths | Traditional Ways | Short Documentary
Daily YT Electronics — Water Tank Alert system using RTOS
Daily YT Engineering — Crystal Defects Explained: Vacancies, Alloying, Solid Solutions & Materials Science
Daily YT Maker — How to Build a DIY Manual Rebar Bending Jig
Daily YT Welding — ArcDroid Plus Power Upgrade Install | Easy Fix for Plus Controller Power Issues

Abandoned Futures

The Convair X-11 and X-12 (SM-65 Atlas Testbeds) Aside — Today's Topic: The Lockheed CL-1201: The Nuclear-Powered Flying Aircraft Carrier Lockheed Actually Designed in 1969

2026-05-22

In 1969, while NASA was landing on the Moon, a team at Lockheed's Georgia division was finishing detailed engineering studies on something even more outrageous: a nuclear-powered aircraft that was bigger than the RMS Titanic. The CL-1201-1-1, as the design was formally designated, had a wingspan of 1,120 feet, a length of 560 feet, a gross weight of 5,265 tons, and was intended to stay airborne for 41 days at a stretch on a single reactor fueling. It was, quite literally, a flying aircraft carrier.

The numbers are not typos. Lockheed engineer R.L. Lichten and his team produced full three-view drawings, weight breakdowns, and mission profiles. Power came from a 1,830-megawatt thermal nuclear reactor mounted amidships, feeding four shielded turbojets and twelve lift jets for slow-speed maneuvering. Cruise speed was Mach 0.8 at 30,000 feet. The airframe carried 22 fighter aircraft (F-4 Phantoms in the original spec, later F-15s) docked beneath the wing on retractable trapezes, plus a 845-troop airborne assault complement. A variant, the CL-1201-2-1, was a pure logistics platform that could lift 1,000 tons of cargo.

Why it died: Three reasons, none of them technical impossibility.

Shielding mass. The crew compartment shielding alone was 393 tons of lead and polyethylene. The reactor compartment shielding was another 553 tons. This was based on 1969 materials science — borated graphite, lead, and water — and it ate the payload margin.
Nuclear airframes were already politically dead. Eisenhower had killed the ANP nuclear bomber program in 1961. Project Pluto's nuclear ramjet ended in 1964. By 1969 nobody in Congress would fund another flying reactor, no matter how clever the engineering.
Cost. Lockheed's own estimate was $4 billion per airframe in 1969 dollars — roughly $34 billion today — and the Air Force wanted 22 of them.

Why 2026 changes the math:

Modern reactor compactness. The kind of high-temperature gas-cooled microreactor that companies like X-energy and BWXT are now licensing for terrestrial use produces 50–100 MW from a unit the size of a shipping container, with passively safe TRISO fuel. A flying platform doesn't need 1,830 MW anymore — modern high-bypass turbofans are roughly 3× more efficient than 1969 turbojets.
Shielding revolution. Boron-loaded hydrogen-rich polymers, tungsten-impregnated composites, and additively manufactured tungsten-carbide lattices cut shield mass by an estimated 60% compared to 1969 lead-and-poly stacks. The actual breakthrough is graded-Z shielding printed in single-piece geometry — impossible to manufacture in 1969.
Autonomous docking. The original design needed human pilots to dock F-4s to a moving wing. Today, autonomous aerial refueling has been demonstrated (MQ-25 Stingray, 2021), and station-keeping under a 560-foot wing is a tractable controls problem.
The mission came back. Persistent airborne ISR platforms, contested-logistics resupply, and Pacific-theater dispersal of tactical aircraft are exactly the problems the CL-1201 was designed to solve. The Air Force's 2024 "Agile Combat Employment" doctrine is, structurally, what Lichten was proposing in 1969.

You wouldn't build the 5,265-ton version. You'd build a 1,500-ton CL-1201 with a 100-MW microreactor, autonomous loitering, and a half-dozen drone hangars. The bones of the design are sound — Lockheed did the structural math correctly, and that math is in declassified report LR 21551.

Key Takeaway: The CL-1201 wasn't science fiction — it was a fully engineered 1969 design killed by 1969 shielding mass and 1969 politics, and both constraints have collapsed in the half-century since.

ArXiv Paper Digest

DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

2026-05-22

Authors: Yunpeng Dong, Jingkai He, Yuze Hou, Dong Du

ArXiv: 2605.22781v1

PDF: Download PDF

Imagine you're playing a video game and want to try a risky move. You'd hit "save" first, attempt the move, and if it goes badly, you reload. Now imagine doing that thousands of times per minute. That's essentially what modern AI agents need to do when they explore possible solutions to a problem — they branch out, try things, fail, roll back, and try again.

The problem: today's "save and reload" for AI agent sandboxes is slow. Each checkpoint copies the entire state of the sandbox — every file, every chunk of memory, every running process. This takes hundreds of milliseconds to several seconds. When an agent needs to explore a tree of possibilities or do reinforcement learning with massive fan-out, that latency becomes a brick wall.

What DeltaBox does: The authors noticed something simple but powerful — consecutive checkpoints in an AI agent's workflow are almost identical. The agent edits one file, runs one command, changes one variable. Why copy the whole world when only a tiny slice changed?

DeltaBox saves only the "delta" — the difference between the current state and the previous checkpoint. Think of it like Git for live processes: instead of duplicating the entire repository every commit, you just record what changed. This includes:

File system deltas: only modified files get tracked
Process memory deltas: only the memory pages that were written to since last checkpoint
Execution context deltas: the small bits of process state (registers, open file handles) that actually shifted

The result is checkpoint and rollback operations that complete in milliseconds rather than seconds — often a 100x+ speedup. That changes the economics of agent exploration completely. Techniques like Monte Carlo tree search, parallel speculative execution, and large-scale RL training that were previously bottlenecked by C/R overhead suddenly become viable.

The key insight: Treating sandbox state as immutable snapshots was a reasonable default borrowed from VM and container worlds, where checkpoints are rare. But AI agents have flipped the workload — checkpoints are now the dominant operation, not the exception. Designing the data structures around incremental change rather than full state matches reality.

This is one of those papers where the trick is "obvious in hindsight" but only obvious because someone carefully measured where the time was actually going. The practical impact is that the next generation of agentic systems doing tree search or RL won't have to pick between "deep exploration" and "reasonable wall-clock time."

Why it matters: By making sandbox checkpoint/rollback 100x faster, DeltaBox removes a fundamental bottleneck that was limiting how deeply AI agents could explore solution spaces during search and reinforcement learning.

Daily Automotive Engines

Oil Galleries and Pressurized Lubrication Routing

2026-05-22

Oil galleries are the drilled passages inside the engine block and head that carry pressurized oil from the pump to every bearing surface. They're invisible from the outside, but their layout determines whether your engine survives 200,000 miles or spins a bearing at 80,000.

After oil leaves the pump and passes through the filter, it enters the main gallery — typically a single long passage running parallel to the crankshaft, drilled through the block from end to end. From there, cross-drilled passages branch off to feed:

Main bearings (highest priority — fed directly off the main gallery)
Rod bearings (fed through drilled passages in the crankshaft itself)
Cam bearings (via vertical risers to the head or cam tunnel)
Lifters and rocker shafts (last in line, lowest pressure)
Piston squirters (often gated by a pressure-relief check valve)

The drilling order matters. Mains get oil first because they carry the heaviest load. Lifters and rockers get whatever pressure remains — which is why hydraulic lifters tick on cold starts when the gallery is still filling.

Real-world example: The Chevy LS engine has a notorious quirk — its priority valve sends oil to the lifters before the mains under certain conditions. When the AFM (cylinder deactivation) lifters fail and dump pressure, the mains starve. This is why LS engines with collapsed AFM lifters often eat a rod bearing as the secondary failure. The gallery layout dictated the failure mode.

Plugs and cleanliness: Galleries are drilled straight through the block, then sealed at the ends with cup plugs or pipe plugs. During a rebuild, every plug must come out so the galleries can be brushed clean — machining chips, old varnish, and sludge hide in there. A single overlooked plug holding back debris will starve a bearing within minutes of fire-up.

Rule of thumb: Oil pressure should be roughly 10 psi per 1,000 rpm at operating temperature, with a minimum of 10 psi at hot idle. A healthy small-block at 3,000 rpm wants ~30 psi; a high-revving sport bike engine may run 60+ psi at redline because the galleries are longer and the bearings are smaller (higher restriction).

Modern engines add variable-displacement oil pumps that modulate gallery pressure based on demand — reducing parasitic loss at cruise while still flooding the bearings at WOT.

See it in action: Check out How the oil system works on a LS and what

#39;s the

quot;barbell

quot; do anyway? by Powell Machine-Powell Cams to see this theory applied.

Key Takeaway: Oil galleries are a prioritized plumbing network — mains first, accessories last — and their drilling order dictates which bearing dies first when pressure drops.

Daily Debugging Puzzle

Java's `Arrays.asList(int[])` Trap: The List of One That Looks Like Four

2026-05-22

Spot the bug. This class enforces a whitelist of allowed network ports. The data is right there in the array — why does every legitimate port get rejected?

import java.util.Arrays;

public class PortChecker {
    private static final int[] ALLOWED = {22, 80, 443, 8080};

    public static boolean isAllowed(int port) {
        return Arrays.asList(ALLOWED).contains(port);
    }

    public static int allowedCount() {
        return Arrays.asList(ALLOWED).size();
    }

    public static void main(String[] args) {
        System.out.println(isAllowed(80));    // expected: true   actual: false
        System.out.println(isAllowed(443));   // expected: true   actual: false
        System.out.println(allowedCount());   // expected: 4      actual: 1
    }
}

The Bug

The signature of Arrays.asList is <T> List<T> asList(T... a). Because T must be a reference type, Java cannot bind it to int. So when you pass an int[], the compiler does not unpack it into individual elements. Instead it infers T = int[] and treats the whole array as a single varargs element.

The result: Arrays.asList(ALLOWED) returns a List<int[]> of size 1, whose sole element is the array itself. Then:

list.contains(80) autoboxes 80 to Integer and looks for it in a list containing one int[]. Integer.equals(int[]) is always false.
list.size() returns 1, not 4.

And here's the truly nasty part: it compiles without a warning. If you had written List<Integer> allowed = Arrays.asList(ALLOWED); the type checker would have caught it. But because contains takes Object and size returns an int, the chained one-liner is perfectly well-typed. Change the array's element type to Integer[] and the bug vanishes — which is why this often slips through in code that was refactored from Integer[] to int[] for "performance."

Static analyzers flag it (IntelliJ shows "Confusing call to Arrays.asList"), but only if you've enabled the inspection.

The Fix

Use a stream and stay in primitive land — no boxing required:

import java.util.Arrays;

public class PortChecker {
    private static final int[] ALLOWED = {22, 80, 443, 8080};

    public static boolean isAllowed(int port) {
        return Arrays.stream(ALLOWED).anyMatch(p -> p == port);
    }

    public static int allowedCount() {
        return ALLOWED.length;
    }
}

If you genuinely need a List<Integer>, box explicitly:

List<Integer> allowed = Arrays.stream(ALLOWED).boxed().toList();

The same trap waits in List.of, Stream.of, Collections.addAll, and any other varargs method called with a primitive array. long[], double[], char[], boolean[] — all of them get wrapped as a single element. Only object arrays (String[], Integer[], etc.) get unpacked the way you'd expect, because then T can bind to the element type.

The deeper lesson: Java's generics and varargs were bolted on after primitives were a fundamental part of the language, and the boundary between them is sharp and silent. When a generic method meets a primitive array, the array becomes one big opaque object — and no amount of code review will see it, because the code looks correct.

Key Takeaway: Arrays.asList(someIntArray) returns a one-element List<int[]>, not a list of integers — because Java generics can't bind a type parameter to a primitive, so the whole array is captured as the single varargs argument.

Daily Digital Circuits

Static Timing Analysis Corners: Why Hardware Engineers Sign Off Chips Across Multiple Universes

2026-05-22

When software ships, it runs on the silicon you bought. When silicon ships, it runs on every silicon that came off the wafer — and each die is slightly different. Static Timing Analysis (STA) corners are how hardware engineers prove the chip works across the full range of manufacturing variation, voltage, and temperature — without simulating a single test vector.

The three axes of variation (PVT):

Process (P): Transistors come out faster or slower than nominal depending on doping, lithography, and etch. Labeled SS (slow-slow), TT (typical), FF (fast-fast), plus skewed corners SF and FS where NMOS and PMOS go opposite directions.
Voltage (V): Vdd droops under load and rises when idle. A 1.0V nominal rail might be specified at 0.9V–1.1V.
Temperature (T): Junction temp swings from −40°C (automotive cold start) to +125°C (datacenter hotspot). Counterintuitively, at low Vdd, cold transistors can be slower than hot ones — "temperature inversion."

The two failures you check at opposite corners:

Setup violations happen when logic is too slow. Check at the slow corner: SS process, low Vdd, high temp (or cold, if inverted). Worst-case data arrival vs. earliest clock capture.
Hold violations happen when logic is too fast — data races through and corrupts the next flop before the clock edge captures the old value. Check at the fast corner: FF process, high Vdd, low temp.

Real example: A 7nm SoC at TSMC typically signs off at ~9 corners: {SS, TT, FF, SF, FS} × {min Vdd, max Vdd} × {−40°C, 0°C, 125°C}, often pruned to the dominant ones per analysis type. Multiply by RC corners (Cmax/Cmin/RCmax/RCmin for interconnect variation) and you get 30+ corner combinations. Each one is a full STA run on hundreds of millions of timing paths.

Rule of thumb — derate the clock: If your nominal cycle is 1.0 ns, plan for usable timing budget around 0.7–0.8 ns after subtracting clock uncertainty (jitter + skew, ~5–10% of period), OCV (on-chip variation) derate (~3–5%), and margin for crosstalk and aging (~5%). The "1 GHz chip" is really a "1.25 GHz design clocked at 1 GHz so it still works on the worst die at 125°C three years from now."

Why this matters: A path that passes at TT by 50 ps may fail SS by 80 ps. Signing off only at typical is how you get a chip that works on the bench and dies in the field.

See it in action: Check out Simple Math behind GPS 🛰️ by Cuemath to see this theory applied.

Key Takeaway: Hardware must meet timing simultaneously across every combination of process, voltage, and temperature corner — slow corners catch setup failures and fast corners catch hold failures, and missing any one means silicon that works in the lab but fails in customers' hands.

Daily Electrical Circuits

Triangle Wave Generators: The Integrator-Comparator Loop

2026-05-22

You can build a beautiful, rock-steady triangle wave generator from just two op-amps, two resistors, and a capacitor — no crystal, no exotic ICs. The circuit pairs an integrator with a Schmitt-trigger comparator in a positive-feedback loop. The comparator output (a square wave) feeds the integrator's input; the integrator's output (a triangle wave) feeds the comparator's threshold input. Each block forces the other to flip at precise voltage limits, producing two synchronized waveforms: square out of one op-amp, triangle out of the other. This is the classic function generator core — exactly what's inside the venerable ICL8038 and XR-2206 chips.

How it works: The comparator output sits at either +Vsat or −Vsat. The integrator sees a constant input, so its output ramps linearly (V = −(Vin/RC)·t). When the triangle crosses the comparator's upper threshold, the square flips negative, and the integrator ramps the other way. Linear ramps in, sharp transitions at the corners — a near-perfect triangle.

Concrete example: Build a 1 kHz triangle generator with ±5 V peaks using a TL072 dual op-amp on ±12 V rails. Use a non-inverting Schmitt comparator with R1 = 10 kΩ from the triangle output to the +input, and R2 = 10 kΩ from the square output (call it ±10 V after saturation losses) to the same +input. The thresholds are ±Vsat·(R1/R2) = ±5 V — exactly what you want. For the integrator, pick C = 10 nF. The frequency formula is:

f = R2 / (4 · R1 · R · C)

Solving for R (integrator resistor) at f = 1 kHz: R = R2/(4·R1·C·f) = 10k/(4·10k·10nF·1k) = 25 kΩ. Use a 22 kΩ fixed resistor in series with a 10 kΩ pot for fine tuning.

Practical gotchas:

Slew rate matters. The comparator's square-wave edges must be faster than your triangle frequency demands. A TL072 (13 V/μs) handles audio fine; above ~50 kHz, switch to an LM319 comparator driving a separate integrator op-amp.
Asymmetric ramps? Op-amp saturation isn't symmetric — +Vsat ≠ |−Vsat|. Use a rail-to-rail output op-amp, or clamp the square wave with back-to-back Zeners for clean symmetric drive.
DC offset drift. The integrator has infinite DC gain, so any input offset voltage drifts the triangle's average. Add a 1 MΩ resistor across the integrator capacitor to bleed off DC — it slightly degrades linearity but kills the drift.
Voltage control. Replace the integrator resistor with a JFET or OTA, and you have a VCO suitable for analog synthesizers.

Rule of thumb: Frequency is inversely proportional to integrator RC. Halve the capacitor, double the frequency. The amplitude depends only on the comparator resistor ratio — change R1/R2 to tune peak voltage without touching frequency.

See it in action: Check out Triangular Wave Generator using OpAmp (Basics, Circuit, Working

amp; Waveforms) Explained by Engineering Funda to see this theory applied.

Key Takeaway: A triangle wave generator is just an integrator and a Schmitt comparator locked in mutual feedback — each forces the other to reverse at precise thresholds, yielding linear ramps and sharp corners for free.

Daily Engineering Lesson

PWM (Pulse Width Modulation): How Digital Signals Control Analog Power

2026-05-22

PWM is how microcontrollers fake an analog output without a DAC. You switch a digital pin on and off very fast, and the ratio of on-time to total period (the duty cycle) determines the average power delivered. A motor, LED, or heater sees the average — not the rapid switching — as long as the frequency is high enough.

The core math: Average voltage = Duty cycle × V_supply. A 12V supply at 25% duty cycle delivers an average of 3V. The duty cycle ranges from 0 (always off) to 1.0 (always on), typically expressed as 0–255 in 8-bit microcontrollers or 0–65535 in 16-bit timers.

Why it works: Inductive loads (motors, solenoids) and thermal loads (heaters, incandescent bulbs) have time constants much longer than the PWM period. The current can't change instantly through an inductor, and a filament can't cool between pulses. They effectively integrate the waveform. For LEDs, the human eye does the integration — anything above ~200 Hz looks steady.

Picking the frequency:

DC motors: 15–25 kHz (above audible — lower frequencies make motors "sing")
LEDs: 200 Hz minimum, 1 kHz+ for camera-friendly
Heaters: 1 Hz or less is fine — thermal mass smooths everything
Servos (RC type): Fixed 50 Hz, 1–2 ms pulse width encodes position (this is PWM as a signaling protocol, not power delivery)

Real-world example: A 3D printer hotend running on 24V at 40W. You don't want to switch 40W on and off mechanically. The firmware reads the thermistor, runs PID, and outputs a PWM duty cycle to a MOSFET. At 50% duty, the heater dissipates 20W on average. At 100%, it pulls full power for fast heat-up. The thermal mass of the heater block makes the switching frequency irrelevant — even 1 Hz works.

The catch — switching losses: The MOSFET dissipates power during the transitions between on and off. Faster switching means more transitions per second and more heat in the FET. This is why motor drivers use gate driver chips: they slam the MOSFET gate from off to on in nanoseconds, minimizing the time spent in the resistive middle region.

Rule of thumb: If you can hear your motor whining at a specific pitch, your PWM frequency is too low — push it above 20 kHz. If your MOSFET runs hot at high duty cycles, the issue is conduction loss (R_DS(on)); if it runs hot at 50% duty, it's switching loss.

See it in action: Check out Pulse Width Modulation (PWM) - Electronics Basics 23 by Simply Electronics to see this theory applied.

Key Takeaway: PWM controls analog power with digital signals by varying duty cycle, relying on the load's inertia (thermal, inductive, or visual) to average out the switching.

Forgotten Books

The 1935 Prescription Modern Parents Keep Forgetting: Let Children Be Bored

2026-05-22

Book: The Wainwright star (1935-12-11) by Unknown (1935)

Read it: Internet Archive

Buried in a small-town Alberta newspaper, in a syndicated health column from the Canadian Medical Association, sits a piece of parenting advice that reads like it was written yesterday by a child psychologist warning about over-scheduled kids. The column, edited by Dr. Grant Fleming, Associate Secretary of the CMA, takes direct aim at what he calls the "extras" — the music lessons, dancing classes, and enrichment activities that wealthier families layer onto childhood.

Fleming's argument is almost startlingly modern:

"We think of the children whose parents are economically able to give them opportunities to study music, dancing, et cetera, as being the lucky ones. They may be, but sometimes the 'extras' are anything but good for them... it is even more desirable that the child have sufficient time for play and an abundance of rest, together with ample opportunity to do the things which he wants to do."

He goes further, drawing a direct line between under-rested children and what he calls a "whole train of undesirable physical and mental conditions":

"Children require sufficient rest, and yet more children are deprived of this essential than suffer from other physical needs. Lack of rest leads to malnutrition, irritability and a whole train of undesirable physical and mental conditions. Play is just as necessary for the child as is food. Play implies doing what the child wants to do, not what someone else considers he should do."

And then the sharpest line — a critique of well-meaning, over-controlling parents that could have come from any contemporary book on childhood development:

"Parents with the best of intentions set out to plan the lives of their children. They may feel that they want to protect them against the difficulties which they themselves had to face. They have forgotten, or else they never knew, that if their child is to be a healthy happy adult, he must grow up in the sense of becoming independent, able to stand on his own feet, and to face the difficulties of life as they come along."

Was he ahead of his time? Remarkably so. Modern developmental research has converged exactly on Fleming's intuition. Peter Gray's work on the decline of free play, the American Academy of Pediatrics' 2018 clinical report declaring play "essential to development," and Jonathan Haidt's 2024 book The Anxious Generation all make the same argument Fleming made in a Wainwright newspaper in December 1935: that self-directed, unsupervised, unstructured play is not a luxury — it is a developmental nutrient, and depriving children of it produces measurable harm.

What's striking is the economic framing. Fleming was writing in the depths of the Depression, and he specifically warns that the children of affluent parents may be the disadvantaged ones — because affluence buys "extras" that crowd out the rest and play that are genuinely essential. The modern equivalent — the resume-padded suburban kid shuttled between travel soccer and Kumon — would have been instantly recognizable to him.

The forgotten claim: In 1935, the Canadian Medical Association was already warning that scheduled enrichment activities harm children by displacing the rest and self-directed play they actually need to become independent adults.

Forgotten Darkroom

The Lost Art of Bathing Photographic Plates in Dye to See New Colors

2026-05-22

Book: Studies in Color Sensitive Photographic Plates and Methods of Sensitizing by Bathing by Francis M. Walters, Jr. and Raymond Davis (1921)

Read it: Internet Archive

In 1921, the U.S. Bureau of Standards published a quiet little Scientific Paper (No. 422, price 15 cents) describing a technique that would astonish almost any modern photographer: you could take an ordinary, garden-variety blue-sensitive photographic plate, soak it in a beaker of pink or green dye, dry it in the dark, and pull out a plate that suddenly "saw" colors it had been blind to moments before.

The book itself is a working chemist's manual. Walters and Davis — an associate physicist and a photographic technologist at the Bureau — were trying to give working photographers and scientists a way to extend the spectral range of cheap commercial plates without buying expensive panchromatic stock. Their key figure shows it plainly:

"Spectrograms in the upper left-hand corner show the regions of color sensitiveness of the three types of commercial dry plates. The other spectrograms show the sensitiveness conferred to ordinary blue sensitive plates (Seed, 26X in this case) by bathing in solutions of sensitizing dyes. The name of the dye used is given under each spectrogram."

The roll call of dyes reads like a forgotten apothecary: Pinaverdol, Cyanin, Homocol, Erythrosin, Pinacyanol, Rose Bengal, Dicyanin. Each one extends the plate's sensitivity to a different band — green, yellow, orange, deep red. Erythrosin (a food-dye cousin of the red dye still used today in maraschino cherries) gave you the green. Pinacyanol pushed you into the red. Dicyanin, a notoriously unstable cyanine dye, was the one that finally let astronomers and spectroscopists photograph the infrared.

Is the technique real? Entirely. This is dye sensitization, discovered by Hermann Vogel in 1873, and it is, almost unbelievably, the same mechanism that makes every color photograph, every digital camera sensor's color filter array, and every dye-sensitized solar cell work. Silver halide crystals only absorb blue and ultraviolet light. The dye molecule sits on the crystal surface, absorbs a photon of the "wrong" color, and injects an excited electron into the silver halide — exactly the trick Michael Grätzel rediscovered in 1991 for solar cells and won fame for.

What's been forgotten isn't the physics. It's the kitchen-sink accessibility. In 1921, a high-school chemistry teacher with a darkroom, a tray, and a 50-cent vial of erythrosin could custom-tune the color response of any plate they bought. The book gives the exact concentrations, bathing times, and drying procedures. Today the same capability is locked inside a fab-grown CMOS sensor with a Bayer filter you cannot modify.

Modern echo: dye-sensitized solar cells (Grätzel cells), CMOS color filter arrays, fluorescent labels in biology — all are direct intellectual descendants of these dye baths.
Lost capability: the ability of an amateur to tune the spectral sensitivity of their own imaging medium with a tray and a bottle.

The forgotten claim: Any blue-sensitive photographic plate could be transformed into a panchromatic or infrared-sensitive plate at home by simply bathing it in the right cyanine or xanthene dye — the same molecular trick that now runs solar cells and digital camera color filters.

Forgotten Patent

Guglielmo Marconi's "Tuned Circuits" Patent: The 1900 Filing That Invented Wireless Channels — and Made the Radio Spectrum Usable

2026-05-22

In the late 1890s, wireless telegraphy worked — barely. Heinrich Hertz had proven electromagnetic waves existed, and Marconi was bouncing crude spark-gap signals across the English Channel. But there was a fatal flaw: every transmitter screamed across the entire spectrum at once. Two stations operating in the same area drowned each other out. Wireless was a single shared shouting match, not a communication medium.

On April 26, 1900, Guglielmo Marconi filed British Patent No. 7777 — famously nicknamed the "four sevens" patent — titled "Improvements in Apparatus for Wireless Telegraphy." The US equivalent followed as US 763,772, granted June 28, 1904. What Marconi described was deceptively simple: matching resonant tuned circuits at both the transmitter and receiver, using inductor-capacitor pairs synchronized to the same frequency. A station tuned to one frequency would ignore signals at other frequencies. Suddenly, multiple wireless conversations could share the air without colliding.

The mechanism was elegant. Each station had a primary oscillating circuit coupled to a secondary antenna circuit, both tuned to identical resonant frequencies via variable inductors. The receiver only "heard" energy that matched its own resonance — a passband filter built from coils, capacitors, and physics. Marconi called it "syntonic" tuning, from the Greek for "together-toned."

The patent was viciously contested. Oliver Lodge had demonstrated tuned resonance in 1897. Nikola Tesla's US 645,576 (1900) covered four-circuit tuning for power transmission. John Stone Stone held related filings. The US Supreme Court eventually invalidated key Marconi claims in Marconi Wireless v. United States (1943), restoring priority to Tesla, Lodge, and Stone — a posthumous correction for Tesla, who had died months earlier. But by then the engineering principle was woven into every radio on Earth.

The modern relevance is staggering. Every concept underlying modern spectrum allocation traces back to syntonic tuning:

FM radio stations at 88.5, 89.1, 89.7 MHz — different LC resonances, same antenna, no interference.
Wi-Fi channels 1, 6, and 11 in the 2.4 GHz band — tuned bandpass filters at the front end of every router and laptop.
Cellular bands — 4G LTE, 5G NR, mmWave — every handset contains dozens of tuned filters (SAW, BAW, FBAR) that are direct descendants of Marconi's LC circuits, just shrunk to micron scale on piezoelectric substrates.
Software-defined radio moves the tuning into DSP, but the conceptual model — "select energy at frequency f, reject everything else" — is unchanged since 1900.
The very idea of a "channel" — that the electromagnetic spectrum can be subdivided, licensed, auctioned, regulated by the FCC and ITU — is a legal and economic structure built atop Marconi's physical insight.

The 2021 US C-band auction raised $81 billion for 280 MHz of spectrum between 3.7 and 3.98 GHz. That auction only exists because tuned circuits let buyers actually use their slice without interference. Without syntonic tuning, spectrum has no boundaries — and therefore no value.

Could it be built better now? It already has been: modern smartphones contain 50+ filters in the RF front end, each performing Marconi's trick at gigahertz frequencies with quality factors approaching 10,000. But the principle — match the resonance, reject the rest — is verbatim from patent 7777.

Key Takeaway: Marconi's 1900 "four sevens" patent didn't invent radio — it invented the idea that radio could have channels, transforming a chaotic shouting match into the structured, licensable spectrum that underpins every wireless technology in existence.

Daily GitHub Zero Stars

Engine-Room-Games/ERG-Generators

2026-05-22

Language: ShaderLab

Link: https://github.com/Engine-Room-Games/ERG-Generators

This repository is a Unity package containing a collection of Roslyn source generators aimed at eliminating the repetitive boilerplate that plagues Unity gameplay programming. Source generators are a compile-time C# feature that produce code automatically based on attributes or patterns in your project — and applying them to common Unity gameplay patterns is a genuinely clever idea that more studios should be exploring.

Anyone who has shipped a Unity title knows the pain points: writing the same SerializedField / property accessor pairs, manually wiring up state machines, hand-rolling component lookups, or maintaining brittle event subscription cleanup code. These are exactly the kinds of patterns that source generators can collapse into a single attribute or interface marker, with zero runtime cost.

What's interesting about this particular repo:

Compile-time, not reflection-based — unlike Unity's typical reflection tricks, source generators produce real, debuggable C# you can step through.
Targeted at gameplay patterns specifically — most Roslyn generator libraries focus on serialization or DI. Tackling gameplay-loop boilerplate is a less-explored niche.
Distributed as a Unity package — meaning it's drop-in installable via UPM, which is the friction-free path Unity devs actually want.
ShaderLab listed as primary language is amusing (likely just a sample scene tipping the detection), but the real meat is C# generator code.

It would be most useful to indie Unity developers and small studios who don't have a dedicated tools programmer and end up writing the same scaffolding game after game. It's also a great learning resource for anyone curious about how to build their own Roslyn source generators — gameplay code is a forgiving sandbox for experimenting with the API.

Zero stars currently, but the concept has real legs. Worth bookmarking even if you don't use Unity, just as an example of applying modern .NET tooling to game development workflows.

Why check it out: A pragmatic application of Roslyn source generators to kill Unity gameplay boilerplate — niche, well-scoped, and genuinely useful.

Daily Hardware Architecture

The Commit Buffer and Architectural State Update: How CPUs Make Speculation Permanent

2026-05-22

Modern CPUs execute hundreds of instructions speculatively, but only a fraction of that work ever becomes real. The commit buffer (sometimes folded into the retirement unit but distinct in many designs) is the hardware that performs the final atomic flip from "speculative result sitting in a physical register" to "architectural state visible to the world." It's the place where the CPU's lies become truth.

The flow looks like this: an instruction retires from the ROB, but its side effects on memory and visible state happen at the commit stage. Three things must happen atomically per instruction:

Architectural register update: The rename map's "committed" pointer for the destination logical register flips to the new physical register.
Store release: Stores graduate from the store buffer to L1 (or write-combining buffers for WC memory).
Exception/event delivery: If the instruction took a fault, the commit buffer is where the CPU stops pretending it didn't happen and jumps to the handler.

Modern wide cores commit 4-8 instructions per cycle. Intel's Golden Cove retires up to 8/cycle; AMD Zen 4 retires 8/cycle; Apple's M-series can hit 8+. The commit buffer needs that bandwidth because if commit is narrower than dispatch, the ROB fills up and the front end stalls.

Concrete example — the AVX-512 store storm: Imagine a loop doing 64-byte stores to memory. Each store sits in the store buffer until commit. Skylake-X has a ~56-entry store buffer and commits up to 2 stores/cycle to L1. If your loop generates stores faster than 2/cycle sustained, the store buffer fills, dispatch stalls, and you've found a commit-bandwidth bottleneck even though arithmetic ports are idle. This is why rep stosb on modern Intel uses a fast-string microcode path that bypasses normal store-buffer commit entirely.

Rule of thumb: Sustainable IPC is bounded by min(decode width, issue width, commit width). On Zen 4 that's min(6, 10, 8) = 6 — decode is the bottleneck. On Golden Cove it's min(6, 12, 8) = 6 — same story. Commit width almost always exceeds sustained IPC, which is intentional: commit must absorb bursts when many short-latency instructions finish together.

Faults make commit interesting. A page fault detected during execution sits dormant in the ROB entry until that instruction reaches the commit point — only then does the CPU flush younger instructions and redirect. This is why Meltdown worked: the speculative load executed and touched cache before commit ever saw the fault.

See it in action: Check out We Woke Up on a Generation Ship Where Everyone Else Had Forgotten Earth by No Sleep Story to see this theory applied.

Key Takeaway: The commit buffer is the atomic boundary where speculative work becomes architectural reality — its width sets the ceiling on sustained throughput, and its in-order discipline is what makes precise exceptions possible.

Hacker News Deep Cuts

Phantasy Star IV – 1993 Developer Interviews

2026-05-22

Link: https://shmuplations.com/phantasystariv/

HN Discussion: 1 points, 0 comments

Shmuplations is one of the quiet treasures of the English-language internet: a labor-of-love archive translating Japanese developer interviews from gaming magazines that would otherwise be lost to time. This particular piece covers Phantasy Star IV: The End of the Millennium, the 1993 Sega Genesis RPG that closed out one of the most ambitious console RPG series of the 16-bit era.

Why should a technical audience care about a 33-year-old JRPG postmortem? A few reasons:

Constraint-driven engineering. Phantasy Star IV shipped on a 6 Mbit cartridge (later expanded) targeting a 7.6 MHz Motorola 68000 with 64KB of RAM. The developers had to fit an epic, multi-continent RPG with a comic-panel cutscene system, animated combat macros ("Combination Techniques"), and a sprawling script into a footprint smaller than a single modern JPEG. The interview likely surfaces the tradeoffs — what got cut, what got compressed, what tricks the team used.
The "macro" combat system. PSIV pioneered programmable combat macros, letting players script multi-turn party behavior. It's a UX pattern that quietly anticipates modern auto-battler design and even MMO rotation tooling.
Production-history primary source. These are direct quotes from director Rieko Kodama (who passed in 2022) and her team. Kodama was one of the rare women leading console RPG development in the early 90s, and her perspective on the industry is genuinely rare archival material.
Translation as preservation. Shmuplations itself is an interesting project — a single translator (with occasional collaborators) doing the work that no studio, publisher, or museum has bothered to do. The site is a quiet case study in how much computing history exists only in print, in another language, and is one motivated person away from being lost entirely.

For anyone who works on games, retro hardware, emulator development, or just appreciates engineering under absurd constraints, these interviews are gold. The fact that this story has one point and zero comments is the kind of curation failure HN's front page algorithm produces regularly — the cost of a system that rewards what looks new over what is enduring.

Why it deserves more upvotes: A rare primary-source interview with the team behind a landmark 16-bit RPG, preserved by a one-person translation project that itself deserves wider recognition.

HN Jobs Teardown

Vocdoni: What Their Hiring Reveals

2026-05-22

Source: HN Who is Hiring

Posted by: ferran_vocdoni

Of every posting in this thread, Vocdoni's is the most strategically revealing — not because of what it says about Vocdoni, but because of what it implies about the maturation (and contradictions) of crypto-adjacent civic tech in 2020.

The stack tells the whole story. They list Ethereum, Tendermint, ZK-SNARKs, Golang, Flutter, and TypeScript. That is an unusually opinionated bill of materials:

Ethereum + Tendermint together is telling — they don't trust a single chain. Tendermint (the BFT consensus engine behind Cosmos) suggests they're running their own sovereign chain for vote tallying, with Ethereum as a settlement/anchoring layer. Classic "L2 before L2 had a name" architecture.
ZK-SNARKs in 2020 is genuinely bleeding-edge. Most teams were still arguing about whether zero-knowledge proofs were production-ready. Vocdoni is betting yes — because anonymous voting requires it.
Flutter for the frontend is the surprise. They want one codebase shipping to iOS, Android, and web simultaneously. That's a small-team tell: they cannot afford three platform specialists.

What the posting reveals about stage and direction. They're hiring a single frontend developer, remote worldwide, with "all code is free open-source." This is a foundation/grant-funded shop, not a VC rocket ship. The breadth of buzzwords ("anonymous voting," "participation platform," "sovereign identity") in a single sentence is a red flag for product focus — they're trying to be three things at once, which is common in crypto civic-tech where the funding follows narrative breadth.

Skills/trends highlighted. The job demands Reactive frameworks + responsive design + Flutter + backend data manipulation. That's a generalist senior, not a junior. The phrase "capacity to learn Flutter" is the giveaway: they know the Flutter talent pool is tiny and they're willing to train. In 2020 this was a leading indicator — Flutter hiring exploded over the next two years.

Red flags: No salary band. No team size. "REMOTE Worldwide" with no timezone constraint often means async-everything chaos. The pitch leads with technology, not problem — a common failure mode for crypto projects where the team is more excited about the stack than the user.

Green flags: Genuinely open-source (verifiable claim, not marketing). Coherent technical thesis — every stack choice serves the anonymous-voting use case. Hiring a frontend dev specifically suggests the protocol layer is stable enough that UX is now the bottleneck, which is a healthy progression for a protocol company.

The signal: When a crypto project's job posting prioritizes Flutter and frontend polish over more protocol engineers, the protocol war is over and the usability war has begun.

Daily Low-Level Programming

The Line Fill Buffer: How the CPU Tracks Cache Misses In Flight

2026-05-22

When your load misses the L1 cache, the CPU doesn't stall the whole pipeline — it allocates a Line Fill Buffer (LFB) entry, records the miss, and keeps executing. The LFB is the hardware structure that tracks every outstanding cache-line request between L1 and the rest of the memory hierarchy. On Intel Skylake-through-Raptor Lake cores there are 10–12 LFBs per core; AMD Zen calls them "Miss Address Buffers" and has 22. When all LFBs are occupied, new misses block — even if the ROB, load queue, and execution units are idle.

Each LFB entry holds the 64-byte cache line being fetched, the physical address, status bits (valid, in-flight, write-combining), and a list of pending loads/stores waiting for that line. Crucially, multiple loads to the same cache line share one LFB — this is called miss-under-miss coalescing. Loads to different lines each need their own.

Why this is the real ceiling on memory bandwidth: Little's Law applies directly. Bandwidth = (concurrent requests) × (line size) / latency. With 12 LFBs, a 64-byte line, and ~80 ns DRAM latency:

12 × 64 / 80e-9 = ~9.6 GB/s per core for pointer-chasing workloads

Your DDR5 modules can deliver 50+ GB/s, but a single core physically cannot ask for more than 12 lines at once. This is why memcpy() benchmarks scale across cores but linked-list traversal doesn't — the LFB count, not DRAM, is the bottleneck.

Real-world example: A hash table with chained collisions. Each probe is a dependent load (you can't issue the next until the current pointer resolves), so only 1 LFB is busy at a time → ~12 ns per probe (L2 hit) or ~80 ns (DRAM). Now switch to open addressing with linear probing: the prefetcher detects the stride, fills 8+ LFBs in parallel, and your effective latency drops to ~10 ns per probe even on misses. Same algorithm complexity, 8× throughput, entirely because the LFBs are working in parallel instead of serially.

Diagnosing it: The perf counter l1d_pend_miss.fb_full counts cycles where a load wanted an LFB but none was free. If this is >5% of cycles, you're LFB-bound. Look for: long dependency chains on pointer loads, code that touches many distinct cache lines per loop iteration, or NT-stores competing with regular loads (write-combining buffers share LFB resources on some microarchitectures).

Rule of thumb: A single core can sustain at most ~10 outstanding cache misses. Design data structures so independent misses can issue together — that's where the parallelism lives.

See it in action: Check out This fix can solve your Spider-Man 2 performance issues 🔧 #shorts by Constant Jitters to see this theory applied.

Key Takeaway: The Line Fill Buffer caps a single core at ~10 concurrent cache misses, making memory-level parallelism — not DRAM bandwidth — the real bottleneck for pointer-heavy workloads.

Reddit Small Subs

Dividing via multiplicative inverse on RISC-V

2026-05-22

Subreddit: r/asm

Discussion: View on Reddit (0 points, 1 comments)

This post links to a deep technical write-up on one of the oldest tricks in the compiler-optimization playbook: replacing integer division with multiplication by a precomputed "magic number." The article walks through how it's done on RISC-V, but the underlying technique applies anywhere — it's the same transformation that GCC, LLVM, and every serious compiler perform silently when you write x / 7 with a constant divisor.

The core idea: division is brutally slow on most CPUs (often 20–40+ cycles, often not pipelined), while multiplication is fast and pipelined (3–5 cycles). If the divisor is a known constant, you can mathematically rewrite n / d as (n * m) >> s, where m is a carefully chosen "multiplicative inverse" and s is a shift amount. The hard part is picking m and s so the result is exact for every possible n in the range — not just approximately right.

What makes this particularly interesting for r/asm readers:

RISC-V is a great teaching vehicle for this. Its instruction set is small enough that you can see exactly which ops the trick depends on — mulh (high half of a multiply), arithmetic shift, and add. No hidden microcode magic.
Signed vs. unsigned matters. The math for unsigned division is cleaner; signed requires an extra correction step because of how rounding-toward-zero interacts with arithmetic shifts of negative numbers.
It's a window into what compilers do for you. Disassemble any optimized C code that divides by a constant and you'll see this pattern. Recognizing it makes reading optimized output much less mysterious.

The technique traces back to Granlund & Montgomery's 1994 paper "Division by Invariant Integers using Multiplication," and Hacker's Delight devotes a full chapter to it. But seeing it implemented in modern RISC-V assembly — with a working derivation rather than just the formula — is the clearest way to actually internalize why the magic numbers work, not just what they are.

If you've ever wondered why x / 10 compiles to a multiply followed by a shift, this is the explanation.

Why read this: Demystifies the multiplicative-inverse trick that every optimizing compiler uses to turn slow constant divisions into fast multiplies and shifts.

RFC Deep Dive

RFC 3117: On the Design of Application Protocols

2026-05-22

RFC: RFC 3117

Published: 2001

Authors: Marshall T. Rose

Most RFCs define a protocol. RFC 3117 is different: it's Marshall Rose stepping back after two decades of designing application protocols (SMTP extensions, SNMP, BEEP, the Internet Message format) to write down the taste behind that work. It is an Informational RFC, not a standard, and reads more like an essay by a craftsman than a specification. If you have ever had to argue with a colleague about whether to use HTTP, gRPC, raw TCP, or roll your own framing, this is the document you wish they had read first.

Rose organizes the design space around a handful of axes that every application protocol implicitly answers, whether the designers realize it or not:

Connection lifecycle: connection-oriented vs. connectionless, and whether one connection carries one exchange (HTTP/1.0), many serialized exchanges (HTTP/1.1 keep-alive), or many interleaved exchanges (BEEP, HTTP/2, AMQP).
Framing: octet-stuffed delimiters (SMTP's lone .), length-prefixed records, or self-describing structures. Rose is blunt that delimiter-based framing is almost always a mistake — it forces escaping, and escaping forces parsers to scan every byte.
Encoding: textual (human-debuggable, expensive to parse, ambiguous about whitespace) vs. binary (compact, fast, opaque on the wire). He notes the textual-protocol religion is largely a historical accident of 7-bit telnet links.
Reporting: in-band status codes, out-of-band events, and whether errors are transient or permanent — a distinction SMTP got right with its 4xx/5xx split and that nearly everyone since has copied.
Asynchrony: can the server push? Can a client have multiple outstanding requests? Is ordering preserved per-channel or globally?

The document's most cited contribution is the section on common pitfalls. Rose enumerates the ways protocols rot: assuming the network is reliable, conflating identification with authentication, making the protocol depend on the transport's flow control instead of providing its own, building in versioning that nobody can actually use, and — his favorite target — putting policy into the protocol when it belongs in the application. The "robustness principle" (be liberal in what you accept) gets a careful re-examination: Rose argues that liberal acceptance creates de facto standards out of bugs, a critique that became orthodoxy two decades later when Postel's principle fell out of favor.

There's a wonderful section on scalability that distinguishes three independent axes — number of concurrent users, size of individual exchanges, and rate of exchanges per user — and points out that protocols optimized for one axis (say, HTTP for many short exchanges) often fail spectacularly on another (long-lived streaming). This framing predates and predicts every "WebSockets vs. long-polling vs. SSE vs. gRPC streaming" debate of the 2010s.

The backstory: Rose wrote this while developing BEEP (RFC 3080), a generic application-protocol framework that was supposed to be the substrate for everything from SYSLOG to network management. BEEP itself never caught on — HTTP ate the world instead — but RFC 3117 outlived its sponsor protocol because the analysis is independent of the conclusion. Read it today and you'll find a checklist that quietly grades HTTP/2, QUIC, gRPC, MQTT, and whatever JSON-over-WebSocket thing your team is currently shipping.

Why it matters: It's the closest thing the IETF has to a written design tradition for application protocols — a vocabulary for the decisions you're making whether you name them or not.

Stack Overflow Unanswered

Sync reset used as async reset

2026-05-22

Stack Overflow: View Question

Tags: verilog, fpga

Score: 0 | Views: 44

The asker sees code shaped like this:

always @(posedge clk or negedge sync_rst_n)
    if (!sync_rst_n) q <= 0;
    else             q <= d;

where sync_rst_n is the output of a two-flop synchronizer fed by an external asynchronous reset. Their intuition is that this looks wrong — you've gone to the trouble of synchronizing the reset, only to then expose it again as an asynchronous edge in the sensitivity list. Why bother synchronizing at all?

Why this is actually the canonical pattern. What's being described is the textbook reset synchronizer (sometimes called "asynchronous assertion, synchronous deassertion"). The motivation isn't to make the reset purely synchronous — it's to fix the one specific failure mode a fully async reset has: if the external reset is deasserted too close to a clock edge, individual flops in the design can come out of reset on different cycles, leading to metastability or inconsistent initial state. By passing reset deassertion through two flops clocked by clk, you guarantee that when downstream flops see sync_rst_n go high, it does so cleanly relative to clk, with enough recovery/removal margin.

Why the sensitivity list still lists negedge sync_rst_n. The assertion of reset should still be asynchronous: you want every flop in the design to drop into reset immediately, without waiting for a clock that may not be running (e.g., during power-up, PLL not locked, clock gated). Putting sync_rst_n in the sensitivity list preserves async assertion. The synchronizer only buys you clean deassertion.

Approach to verifying the pattern is correct:

Confirm the synchronizer flops themselves use async assertion from the raw reset (otherwise they can't come out of an unclocked state).
Confirm sync_rst_n fans out to every downstream flop's sensitivity list — mixing sync and async reset styles across the same clock domain is where trouble starts.
Apply timing constraints (e.g. set_false_path -from [raw_rst] -to [synchronizer_first_flop]) so STA doesn't try to time the async input, but still times recovery/removal on the synchronized output.

Gotchas: per clock domain you need a separate synchronizer — a single sync_rst_n shared across domains defeats the purpose. Xilinx UG949 and Cliff Cummings' SNUG 2003 paper "Synthesis and Scripting Techniques for Designing Multi-Asynchronous Clock Designs" are the standard references; both explicitly recommend this exact construct.

The challenge: The code looks contradictory but is actually the industry-standard reset synchronizer — async assertion for safety, synchronous deassertion for clean recovery timing.

Daily Software Engineering

The Leader Election Pattern: Picking One Node to Do the Singleton Work

2026-05-22

You scaled your service to five replicas for availability. Great. Now you need to run a nightly cleanup job, poll a third-party API every minute, or process a queue that demands strict ordering. If all five replicas do it, you get duplicate work, race conditions, and rate-limit violations. If you hardcode "only replica-0 does it," you've lost your availability the moment replica-0 dies.

Leader election solves this: the cluster agrees on exactly one node to perform the singleton work, and automatically picks a new one when that node dies.

How it works in practice. Every node tries to acquire a lease — a time-bounded lock in a coordination store (etcd, ZooKeeper, Consul, Redis with Redlock, or a database row with a TTL). The winner becomes the leader, periodically renews the lease, and does the singleton work. Losers sit in standby, retrying the acquisition. If the leader crashes, its lease expires, and a standby grabs it.

Concrete example. Kubernetes controllers use this constantly. The kube-controller-manager runs as multiple replicas for HA, but only one is "active." They race to update a Lease object in the API server every few seconds. If the active one stops renewing for ~15 seconds, another takes over. Your own services can do the same with the client-go/tools/leaderelection package — about 30 lines of setup.

The rule of thumb for lease duration. Pick a lease TTL that's roughly 3× your renewal interval, and a renewal interval that's 10× your typical network round-trip. So with a 50ms RTT to etcd, renew every 500ms, lease for 1.5s. Shorter = faster failover but more risk of flapping under load spikes. Longer = stable but slow recovery. A common production setting is renew=2s, lease=15s — accepting 15s of downtime to avoid false failovers during GC pauses.

The trap nobody warns you about: split-brain on long pauses. Suppose the leader gets stuck in a 20-second GC pause. Its lease expires, a new leader takes over, and then the old leader wakes up — still thinking it's the leader — and writes to your database. Two leaders, corrupted state.

The defense is fencing tokens: every lease comes with a monotonically increasing number. The leader sends that number with every write, and the storage layer rejects writes with a stale token. etcd and ZooKeeper provide this natively. If your store doesn't support fencing, leader election alone is not safe for writes — it's only safe for idempotent reads or work you can afford to do twice.

Key Takeaway: Leader election gives you a singleton in a redundant cluster — but without fencing tokens, a paused leader can wake up and corrupt the state a new leader has already started writing.

Tool Nobody Knows

parallel: GNU parallel's Tricks That Aren't Just xargs -P

2026-05-22

Everyone discovers xargs -P and thinks they've solved parallelism. Then they hit a job where they need per-input output isolation, resume-from-failure, or remote execution across a cluster of boxes, and xargs starts looking like a toy. GNU parallel has been quietly doing all of this since 2010, and most people use maybe 5% of it.

The killer feature nobody talks about is --joblog combined with --resume-failed. Run a batch of 10,000 jobs, half fail because S3 throttled you, fix the throttling, and re-run exactly the failures:

parallel --joblog jobs.log --resume-failed -j 16 \
  './upload.sh {}' ::: files/*.tar.gz

The joblog records exit codes, wall time, host, and the exact input. --resume skips successes; --resume-failed retries non-zero exits. xargs has nothing for this — you'd be writing your own state file.

Output never interleaves. Run noisy commands in parallel without garbled stdout:

parallel --line-buffer 'curl -s {} | grep -i error' ::: $(cat urls.txt)
# Or fully grouped (default) — each job's full output prints atomically when done
parallel 'pytest tests/{}' ::: unit integration e2e

The ::: input syntax composes like a Cartesian product, which xargs cannot do at all:

# Try every combination of compiler × optimization × file
parallel 'gcc -{2} {3} -o /tmp/{1}-{2} {3}' \
  ::: gcc-11 gcc-12 clang-15 \
  ::: O0 O2 O3 \
  ::: src/*.c

That's a 3 × 3 × N matrix expansion in one line. The substitution operators are surprisingly rich: {.} strips extension, {/} strips path, {//} keeps only path, {#} is the job number, {%} is the slot number (great for round-robin pinning to GPUs):

parallel -j 4 'CUDA_VISIBLE_DEVICES={%} python train.py {}' ::: configs/*.yaml

Remote execution is built in. If you have SSH access to a few machines, parallel will distribute work for you:

parallel -S host1,host2,host3 --transferfile {} --return {}.out \
  './process.sh {}' ::: data/*.bin

It rsyncs the input, runs the command remotely, copies the result back. No Ansible, no Kubernetes — three flags.

Progress and ETA you can actually read:

parallel --bar --eta 'convert {} {.}.webp' ::: photos/*.jpg
# Computers: 1:local / 4 / 1247
# 53% 663:584=15m32s 12.4s/job

The pitfall everyone hits: parallel buffers output to keep it clean, which means a stuck job has invisible output. Use --line-buffer when you want streaming progress, --tag to prefix each line with the input so you can tell jobs apart in interleaved mode, and --halt now,fail=1 to bail on first failure instead of grinding through 9,999 doomed jobs.

One more trick — --pipe turns parallel into a parallel tee for chunked stdin:

# Process a giant log file with 8 parallel workers, 1MB chunks
cat huge.log | parallel --pipe --block 1M -j 8 'grep -c ERROR'
# Sum the per-chunk counts; faster than single-threaded grep on multi-core

It's the closest thing Unix has to map-reduce in a single binary, and it's already installed on half the boxes you ssh into.

Key Takeaway: GNU parallel is xargs with a joblog, Cartesian inputs, remote execution, and stdin chunking — learn the four flags --joblog, :::, -S, and --pipe and you've replaced a dozen ad-hoc shell scripts.

What If Engineering

What If We Built a Mass Driver Up the Side of a Mountain to Launch Satellites?

2026-05-22

Rockets are absurd: 90% of liftoff mass is propellant burned to lift propellant. The mass driver dream — an electromagnetic catapult that simply throws payloads at orbital speed — has haunted aerospace since Gerard O'Neill sketched one in the 1970s. Let's build the most plausible version: a linear motor track laid up the eastern flank of Chimborazo (6,263 m), Ecuador's tallest peak and the point on Earth's surface farthest from its center.

The geometry. Run an evacuated tube 20 km long up a 15° grade, exiting near the summit aimed eastward to steal Earth's rotation (~465 m/s free at the equator). We need a muzzle velocity of about 7.8 km/s to reach low orbit, minus the rotational bonus, plus losses — call it 7.5 km/s.

The acceleration problem. Using v² = 2aL:

a = (7,500)² / (2 × 20,000) = 1,400 m/s² ≈ 143 g

That instantly disqualifies humans (and most electronics). But hardened artillery shells already survive 15,000 g at firing, so ruggedized fuel canisters, water, structural beams, and milspec satellites are fair game. Acceleration time: just 5.3 seconds.

The power bill. Kinetic energy of a 1,000 kg slug at 7.5 km/s:

KE = ½ × 1,000 × (7,500)² = 28.1 GJ

Delivered in 5.3 seconds means a peak draw of ~5.3 GW — five Hoover Dams during the launch pulse. Impossible to pull from the grid; the answer is a homopolar flywheel bank or a capacitor farm that trickle-charges between shots. At 90% efficiency and one launch per hour, average grid draw drops to a manageable 8.7 MW.

The atmosphere will try to murder your payload. Even at 6.3 km altitude, air density is ~0.66 kg/m³ (half sea level). For a 0.5 m² blunt projectile at Cd ≈ 0.3:

F_drag = ½ × 0.66 × (7,500)² × 0.3 × 0.5 = 2.78 MN

On a 1,000 kg slug, that's 283 g of deceleration — worse than the launch acceleration, and stagnation temperatures hit ~20,000 K. The projectile would shed half its kinetic energy and most of its nose before reaching space.

Two ways to survive:

Sabot + ablative nose cap: A tungsten-tipped shroud absorbs 5–10 seconds of plasma, sheds at 80 km, payload coasts the rest.
Onboard kick stage: Mass driver only provides ~5 km/s; a small solid rocket adds the final 2.5 km/s above the dense air. This is the SpinLaunch playbook, just linear instead of rotational.

Track engineering. 20 km of evacuated tube with superconducting coils every few meters — call it a horizontal LHC, but for FedEx. The accelerating force on the bucket is 1.4 MN; the rails must transfer that into the mountain without buckling. Anchoring requires roughly one Hoover Dam's worth of concrete spread along the bedrock, plus active cooling to handle the magnetic eddy-current heating.

Economics. At 28 GJ per shot, electricity costs about $230 per 1,000 kg launched. Compare to Falcon 9 at ~$2,700/kg. Even amortizing a $20B mountain installation across 100,000 launches, you arrive at ~$400/kg — a fivefold reduction, and zero propellant logistics.

Key Takeaway: A mountainside mass driver is energetically trivial and structurally feasible — but hypersonic atmospheric drag, not the catapult itself, is the wall that forces you to bolt a rocket onto the payload anyway.

Wikipedia Rabbit Hole

Rhenium

2026-05-22

Wikipedia: Read the full article

Rhenium is the last stable element to be discovered on Earth — found in 1925, a full 56 years after Mendeleev predicted it. Dmitri had penciled in a blank square between manganese and technetium and called the unknown occupant "dvi-manganese." It took chemists in Berlin painstakingly processing 660 kg of molybdenite ore to extract just one gram of the stuff. Today it remains one of the rarest elements in Earth's crust, with no concentrated ore of its own — it is scraped, almost as an afterthought, from the flue dust of copper-molybdenum smelters.

So why do we care about a metal so scarce it makes platinum look common? Because rhenium has the third-highest melting point of any element (3,186 °C, behind only tungsten and carbon) and the highest boiling point of any element, period. When you dissolve a few percent of it into a nickel-based superalloy, something remarkable happens: the alloy's resistance to creep — that slow, agonizing deformation metals suffer under sustained heat and stress — improves dramatically. The rhenium atoms park themselves at grain boundaries and refuse to budge, even as the surrounding crystal lattice begs to slide.

This is the secret behind modern jet engines. The single-crystal turbine blades inside a Rolls-Royce Trent or a GE9X spin in gas streams hotter than the blade's own melting point. They survive because:

They are grown as one continuous crystal, eliminating grain boundaries entirely
They contain 3–6% rhenium (second-generation blades) or even higher in third-generation alloys
Cool air is bled from the compressor and pumped through internal channels, film-cooling the blade surface

The economic consequence is brutal: roughly 70% of global rhenium production goes into turbine blades, and the aerospace industry is essentially the sole buyer. When General Electric announced its new alloys in the 2000s, rhenium prices spiked from around $1,000/kg to over $10,000/kg almost overnight. GE responded by launching one of the most aggressive recycling programs in metallurgy — used turbine blades are now treated like gold ore, with the rhenium chemically stripped and recycled. The company also developed "rhenium-lite" alloys to reduce dependency on a metal whose supply could be choked off by a single mine closure in Chile or Kazakhstan.

Here's where it gets weirder: rhenium is so rare that more of it exists in some meteorites than in equivalent volumes of Earth's crust. Geochemists use the rhenium-osmium isotope system to date the formation of ore deposits and even to fingerprint the age of Earth's continental crust. The very atoms keeping your transatlantic flight aloft were forged in stellar explosions and concentrated in trace amounts that we are now, somewhat absurdly, depending on for global aviation.

Down the rabbit hole: The element that lets jet engines exist is so rare that recycling old turbine blades is now more economical than mining new supply — and the entire aerospace industry is one geopolitical hiccup away from a crisis.

Daily YT Documentary

Rob Beckett talks about traditional blacksmiths | Traditional Ways | Short Documentary

2026-05-22

Channel: Joseph Maclachlan (2 subscribers)

Honestly, this batch is rough — most of the candidates are #shorts, fan dubs, or hashtag-spam clips with no real substance. The one genuine documentary attempt is this interview-style piece from a tiny channel (just 2 subscribers!) profiling a village blacksmith named Rob Beckett.

Traditional blacksmithing is one of those crafts that's been quietly disappearing for a century, but a small number of practitioners still keep the forge lit — making hand tools, gate hardware, horseshoes, and architectural ironwork using techniques that haven't fundamentally changed since the medieval period. A short interview segment with a working smith is a chance to hear, in their own words, how they think about heat colors, hammer control, the difference between mild steel and high-carbon stock, and what it's like to sustain a workshop economically in 2026.

A caveat: with only 2 subscribers and a brief description, this is almost certainly amateur-produced and possibly very short. But it's the only video in this list that's actually trying to document something — a real person doing a real skilled trade — rather than slap a hashtag salad on a movie clip. Worth a few minutes if you're curious about heritage crafts.

Why watch: A rare first-hand glimpse of a village blacksmith still practicing a centuries-old trade, from a channel so small you're probably one of its first viewers.

Daily YT Electronics

Water Tank Alert system using RTOS

2026-05-22

Channel: aarsh ahuja (1 subscribers)

Most "smart water tank" projects are Arduino + ultrasonic sensor + buzzer, wired together in a single loop(). This one reaches for something more interesting: an STM32F446RE running FreeRTOS, where the level sensing, alert logic, and user feedback are separated into discrete tasks coordinated by the kernel.

That framing matters. A water-tank monitor is the kind of small, bounded problem where you can actually see why an RTOS is useful — you have a slow sensor read, a time-critical alert, and a UI that all need to coexist without blocking each other. Doing it on bare-metal Arduino-style code forces you into fragile timer ISRs and state machines; doing it with FreeRTOS tasks, queues, and semaphores is the textbook example used to teach preemptive scheduling.

The STM32F446RE (Nucleo board) is also a solid choice — it's the same Cortex-M4 platform used in a lot of professional embedded curricula, so the techniques transfer directly to industrial work. Expect to see task creation, inter-task communication, and probably priority assignment for the alert path.

Caveat: this is a 1-subscriber channel and likely a student project, so production polish will be limited. But for someone learning the jump from Arduino's loop() to real RTOS-based firmware, watching a peer work through a concrete example is often more instructive than a polished tutorial.

Why watch: A small, concrete example of FreeRTOS task decomposition on STM32 — exactly the right scale to internalize why an RTOS beats a giant loop().

Daily YT Engineering

Crystal Defects Explained: Vacancies, Alloying, Solid Solutions & Materials Science

2026-05-22

Channel: STEPX Journal (123 subscribers)

This is the standout pick from a batch dominated by overlapping "intro to crystal structures" videos. Where most of the candidates stop at naming BCC, FCC, and HCP, this one tackles the more interesting follow-up question: why do real materials behave nothing like the textbook lattice diagrams?

The video focuses on imperfections — vacancies, interstitials, substitutional atoms, and the solid solutions that result when you deliberately mix species into a host lattice. This is the conceptual bridge between pure-element crystallography and actual engineering alloys. Without defects, you cannot explain diffusion, why work hardening exists, why brass is stronger than copper, or why a tiny percentage of carbon transforms iron into steel.

The framing is also pedagogically honest: the title leads with the counterintuitive idea that "perfect materials do not exist" and that microscopic flaws are what make materials useful, rather than treating defects as failures. That's the right mental model for anyone moving past first-year materials science.

STEPX Journal is a small channel (123 subs) but the topic selection here is sharper than the generic BCC/FCC explainer two slots above it. If you only watch one video from this list, the defects angle gives you the most leverage — once you understand point defects and solid solutions, the rest of alloy design starts making sense.

Why watch: Defects and solid solutions are where crystallography stops being trivia and starts explaining real alloy behavior — this video targets exactly that conceptual bridge.

Daily YT Maker

How to Build a DIY Manual Rebar Bending Jig

2026-05-22

Channel: Tech DIY Hacks (8730 subscribers)

Rebar bending is one of those tasks that looks trivial until you try to do it consistently by hand — and then you realize that every stirrup, hook, and L-bend in a reinforced concrete project depends on producing the same angle, the same radius, and the same leg length over and over again. This video walks through the construction of a manual bending jig that turns that finicky process into a repeatable one, using nothing more than a steel plate, a couple of strategically placed pegs, and a lever arm.

What makes the build worth watching is the geometry. The placement of the fulcrum peg versus the forming peg determines the inside bend radius — too tight and you'll crack the rebar, too loose and your stirrups won't fit the form. The video shows how to size those pegs against the rebar diameter (the rule of thumb is roughly 4× the bar diameter for the inside radius on grade 60 rebar) and how a fixed stop lets you reproduce the same angle without measuring each time.

It's a practical introduction to jig design in general: locate, constrain, repeat. The same principles apply whether you're bending steel, drilling dowel holes, or routing identical parts. For anyone pouring footings, building a small shop, or just curious about how reinforced concrete actually gets made, this is a solid hour of fundamentals.

Why watch: A clear lesson in jig geometry that turns inconsistent hand-bending into precise, repeatable rebar shapes.

Daily YT Welding

ArcDroid Plus Power Upgrade Install | Easy Fix for Plus Controller Power Issues

2026-05-22

Channel: 6.8liter (912 subscribers)

Of the candidates this week, most are either hashtag-spam product shots (the repeated "custom plasma cutting" uploads from Wilson Iron and Wood), DXF file marketing reels, or vague tool-review teasers. This ArcDroid video is the only one that actually teaches a specific, repeatable procedure: installing the power upgrade board that the newer ArcDroid Plus controller requires to handle its higher current draw.

If you own a portable CNC plasma table — and the ArcDroid is one of the more popular hobbyist/small-shop units out there — this is exactly the kind of factory-recommended retrofit that gets glossed over in official documentation. The video walks through why the upgrade is needed (the Plus controller pulls more power than the original board was designed to deliver, leading to brownouts and erratic behavior), what the new board does in the circuit, and the actual install steps inside the controller enclosure.

It's a niche topic, but that's precisely what makes it valuable: when your machine starts misbehaving after a controller swap, a clear walkthrough from someone who has done the fix is worth more than a dozen forum threads. The presenter also addresses the symptoms owners are likely seeing, which helps you diagnose whether you actually need the upgrade before tearing into your hardware.

Why watch: A focused, practical retrofit walkthrough for ArcDroid Plus owners experiencing power-related controller issues.

All newsletters

Daily Digest — 2026-05-22

In this digest

The Convair X-11 and X-12 (SM-65 Atlas Testbeds) Aside — Today's Topic: The Lockheed CL-1201: The Nuclear-Powered Flying Aircraft Carrier Lockheed Actually Designed in 1969

DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback

Oil Galleries and Pressurized Lubrication Routing

Java's Arrays.asList(int[]) Trap: The List of One That Looks Like Four

The Bug

The Fix

Static Timing Analysis Corners: Why Hardware Engineers Sign Off Chips Across Multiple Universes

Triangle Wave Generators: The Integrator-Comparator Loop

PWM (Pulse Width Modulation): How Digital Signals Control Analog Power

The 1935 Prescription Modern Parents Keep Forgetting: Let Children Be Bored

The Lost Art of Bathing Photographic Plates in Dye to See New Colors

Guglielmo Marconi's "Tuned Circuits" Patent: The 1900 Filing That Invented Wireless Channels — and Made the Radio Spectrum Usable

Engine-Room-Games/ERG-Generators

The Commit Buffer and Architectural State Update: How CPUs Make Speculation Permanent

Phantasy Star IV – 1993 Developer Interviews

Vocdoni: What Their Hiring Reveals

The Line Fill Buffer: How the CPU Tracks Cache Misses In Flight

Dividing via multiplicative inverse on RISC-V

RFC 3117: On the Design of Application Protocols

Sync reset used as async reset

The Leader Election Pattern: Picking One Node to Do the Singleton Work

parallel: GNU parallel's Tricks That Aren't Just xargs -P

What If We Built a Mass Driver Up the Side of a Mountain to Launch Satellites?

Rhenium

Rob Beckett talks about traditional blacksmiths | Traditional Ways | Short Documentary

Water Tank Alert system using RTOS

Crystal Defects Explained: Vacancies, Alloying, Solid Solutions & Materials Science

How to Build a DIY Manual Rebar Bending Jig

ArcDroid Plus Power Upgrade Install | Easy Fix for Plus Controller Power Issues

Java's `Arrays.asList(int[])` Trap: The List of One That Looks Like Four