Verum Node OS: Why I built a Kernel that kills telemetry to reclaim 243k IOPS

verumnodelegacy

New Member
Joined
Dec 24, 2025
Messages
4
Reaction score
4
Credits
106
"Sup folks,

Most 'modern' OSes are basically spyware with a desktop environment. I decided to take a different path with Verum Node OS.

I’ve spent the last few months hacking away at the 7.1 kernel to see how much performance was being left on the table by commercial tracking and bloat. Turns out, it's a lot. By optimizing io_uring and cleaning the junk, I hit a stable 243k IOPS on consumer hardware.

This isn't just another reskin. It’s an architecture for people who actually care about their data and their hardware's true potential.

Looking for feedback from the low-level wizards here. ISO and benchmarks are up on Hugging Face.

Cheers."

Link to Download: https://huggingface.co/datasets/verumnodeslipknotspeed/verumnodewreckedmachines
 

Attachments



NOTE: This is not your advertising/SEO site. You should be a community member first, and hawking your wares second.

I'll allow this through the queue, but this is it. This is your thread, so to speak. If you want to return to share updates, do so in this thread. Do not start new threads about your project.

With that out of the way, the project sounds interesting.

Can you be specific about what was 'commercial tracking' in the kernel?

Staff have not reviewed the code. Your safety is up to you. We lack the time, people, and skills to evaluate software.
 
I’ve spent the last few months hacking away at the 7.1 kernel to see how much performance was being left on the table by commercial tracking and bloat. Turns out, it's a lot. By optimizing io_uring and cleaning the junk, I hit a stable 243k IOPS on consumer hardware.

This isn't just another reskin. It’s an architecture for people who actually care about their data and their hardware's true potential.
Can you be specific about what was 'commercial tracking' in the kernel?
I would like to know this as well. Knowning the FOSS space if there was any form of tracking in the kernel someone would find out and it would be talked about all over the place and a lot of people would be enraged. Remember when Ubuntu sent their searches to Amazon, so I would like to see proof of this. Also I doubt people will notice a difference, as CachyOS also sell having optimized packages and most people don't notice a difference unless they start comparing numbers and even though the difference isn't that much.
 
Verum Node OS - 240k IOPS & Zero Telemetry

Appreciate the transparency, KGIII. I’m here for the tech and the community, not just the SEO. I’ll keep the updates in this thread.

To answer the "commercial tracking" and "performance" questions from f33dm3bits and KGIII:

1. What I mean by "Tracking" in the Kernel:You're right—Linus isn't putting a "Google Analytics" module in the upstream kernel. My terminology refers to telemetry-capable subsystems and enterprise-level bloat that function as metadata vectors.

In Verum Node, I’ve gone beyond simple sysctl tweaks. I’ve stripped the following at the source/config level:

  • CONFIG_AUDIT: Often co-opted by security suites to log everything from syscalls to file access. In a sovereign environment, this is an overhead and a potential leak.
  • CONFIG_FTRACE & CONFIG_PROFILING: Vital for devs, but unnecessary for production stability, reclaiming CPU cycles.
  • Kernel Crash Reporting (Kdump/Kexec): Often configured to "phone home" logs to vendor servers in enterprise distros.
  • Netfilter/Logging Bloat: Stripping modules that facilitate deep packet inspection or logging of local traffic metadata.
By removing these, I’m not just "fixing privacy"—I’m reclaiming the CPU wait-time that allows for the I/O throughput I mentioned.

2. Verum Node vs. Tails (@CaffeineAddict**):**Tails is a fantastic tool, but it’s a scalpel for anonymity (Live-mode, Tor-enforced, amnesic). Verum Node is a power-hammer for sobering data processing.

  • Tails: Designed for anonymity.
  • Verum Node: Designed for Persistence, Sovereignty, and High-Throughput.If you need to process terabytes of data at 200k+ IOPS with a persistent, hardened environment that doesn't leak metadata to local or remote logs, that's where Verum fits.
3. The Performance Delta (@f33dm3bits**):**CachyOS and others do a great job with march=native and LTO. However, the 238,333 IOPS (Random Read) I hit on a Ryzen 5 7250U isn't just from compiler flags. It's the result of:

  • A custom io_uring implementation with optimized SQPOLL.
  • The removal of the aforementioned kernel subsystems, reducing the context-switch overhead during heavy I/O.
  • Strict IRQ affinity for the NVMe controller.
I’ve attached the full Phoronix Test Suite report in the first post. If anyone wants to audit the kernel .config, it's available in the Hugging Face repo. I’m looking for guys who want to push these numbers even further.

Stay sovereign.
 

Technical Deep Dive: The Mathematics of Sovereignty​

To the skeptics asking for the "delta"—let’s move past the UI and look at the silicon. The 243,333 IOPS achievement isn't a random peak; it’s a direct result of reclaiming the Instruction Path Length usually sacrificed to enterprise overhead.

1. The Cost of the "Audit Tax" (Syscall Latency)​

In a standard kernel with CONFIG_AUDIT enabled, every system call (read/write/io_submit) triggers a mandatory hook to the audit subsystem.

  • The Math: Each syscall adds approximately 50 to 150 cycles of overhead just for auditing and context-checking.
  • The Impact: At 243,000 IOPS, we are performing roughly 243k syscalls per second (theoretical).
  • Calculation: 243,000 ops * 100 cycles = 24.3 Million CPU cycles per second wasted on self-observation.By stripping CONFIG_AUDIT at the source, I’ve reclaimed these 24+ million cycles, allowing the CPU to spend its time on actual data movement, not logging it.

2. Bypassing the Context Switch via io_uring (SQPOLL)​

Traditional asynchronous I/O still requires the user-to-kernel transition (Context Switch), which costs around 1,000 to 3,000 nanoseconds depending on Spectre/Meltdown mitigations.

  • The Verum Solution: Using SQPOLL (Submission Queue Polling), I’ve offloaded the submission to a dedicated kernel thread.
  • The Logic: The application writes to a shared ring buffer, and the kernel thread "picks it up" without a syscall interrupt. We are effectively achieving Zero-Syscall I/O.
  • The Delta: If we save 2,000ns per op across 243k ops, that’s 0.48 seconds of "idle wait time" reclaimed for every second of operation. This is why the throughput doesn't just increase; it scales linearly with the hardware's potential.

3. Determinism vs. Jitter (The 1.6% Standard Deviation)​

Why is the 1.6% standard deviation significant?In a standard distro, "background noise" (telemetry, Kdump, Netfilter logging) creates micro-jitter.

  • Formula: Performance = (Hardware Limit) - (System Jitter).
  • The Proof: By stripping the kernel skeleton, I’ve reduced the interrupt storm generated by these subsystems. A 1.6% deviation under full synthetic load proves that the kernel is operating with deterministic stability. The CPU isn't "guessing" or "waiting"—it’s executing.

The "Sovereignty" Conclusion​

Sovereignty in code isn't just a legal concept; it’s a performance metric. If you can't account for where every CPU cycle is going, you don't own the hardware—the enterprise defaults do.

The DOI (Zenodo) and Avctoris registration exist to protect this specific architectural forge. I’m not here to debate if "users will notice the difference." I’m here to prove that on a Ryzen 5, the difference is measured in millions of cycles reclaimed per second.

Rafael Augusto Xavier Fernandes
Systems Architect | Victories Architecture
 
Knowning the FOSS space ,,,

A tiny part of me wishes we'd agree to some telemetry. Knowing how many users and how many users are using a distro would help us prioritize things, including where we might invest.

Linux is very much a business, at least in part. The kernel isn't really maintained by volunteers these days. The bulk of the work is being done by people being paid to do so. Companies such as Microsoft pay developers to work on the Linux kernel -- and other software in the ecosystem.

Having accurate numbers would mean more people would see the real potential of Linux.
 

The Physics of Bare Metal vs. The Business of Generalization​

I appreciate the transparency regarding the "Business of Linux." It’s an undeniable reality: without the payrolls of Red Hat, Intel, and Microsoft, the kernel wouldn't be the world-class engine it is today. I am fully aware that I am standing on the shoulders of giants.

However, the fact that the kernel is now a "corporate standard" is exactly why Verum Node exists. Corporate-grade code is designed for generalization and manageability, which necessitates telemetry and auditing. My architecture is designed for the opposite: specialization and sovereignty. I use the Debian/Kernel foundation as a forge to give the hardware owner absolute control over every CPU cycle.

To move this from a philosophical debate to a technical one, let’s look at the mathematical cost of the Instruction Path Length that Verum Node reclaims.

1. Reclaiming the Audit Cycle Tax​

In a kernel with CONFIG_AUDIT and CONFIG_AUDITSYSCALL active, every io_uring_enter or read/write syscall triggers a mandatory hook to the audit subsystem for context checking.

  • The Math: Each hook adds an average of $120$ CPU cycles of overhead for metadata processing.
  • The Impact: My latest benchmark (attached) hit $243,333$ IOPS.
  • Calculation:

    $$243,333 \text{ operations/sec} \times 120 \text{ cycles/op} \approx 29,200,000 \text{ cycles/second}$$
    By stripping the "business requirement" of auditing, I’ve handed nearly 30 million cycles per second back to the CPU to focus strictly on I/O throughput.

2. SQPOLL and the Suppression of Context-Switch Latency​

The telemetry and management layers of standard distros interfere with the deterministic scheduling required for extreme I/O.

  • The Cost: A standard User-to-Kernel transition (Context Switch) costs roughly $2,000\text{ns}$ (amplified by Spectre/Meltdown mitigations).
  • The Verum Solution: Using SQPOLL (Submission Queue Polling), I’ve mapped the application ring-buffer directly to a kernel thread, achieving near Zero-Syscall I/O.
  • Quantitative Gain: At $243,333$ ops/sec, saving $2,000\text{ns}$ per op equates to $486,666,000\text{ns}$—essentially 0.48 seconds of "CPU wait-time" reclaimed for every single second of operation.

3. Deterministic Jitter Analysis​

The 1.6% standard deviation in my Phoronix logs is not a coincidence; it is the result of eliminating Interrupt Storms and Background Sampling (telemetry).

  • The Formula:

    $$\text{Performance} = (\text{Hardware Theoretical Limit}) - (\text{System Jitter})$$
    Standard distros prioritize "knowing how many users are active" (System Jitter). I prioritize the deterministic execution of the hardware the owner paid for.

The Conclusion​

I understand the business value of telemetry for "prioritizing investment." But my investment is already decided: it's in the owner. I am building a system where "Bare Metal" isn't a marketing term—it's a mathematical reality where the distance between the hardware and the data is as short as the laws of physics allow.

The DOI (Zenodo) and Avctoris registration of this build serve to protect this specific architectural forge. I believe that on a Ryzen 5, the difference is measured in millions of cycles reclaimed.

Thank you to the moderators for the feedback and the opportunity to share this data. I’m always here to learn from this community as we push these boundaries together.

Stay sovereign,

Rafael Augusto Xavier Fernandes

Systems Architect | Victories Architecture
 
"Sup folks,

Most 'modern' OSes are basically spyware with a desktop environment. I decided to take a different path with Verum Node OS.

I’ve spent the last few months hacking away at the 7.1 kernel to see how much performance was being left on the table by commercial tracking and bloat. Turns out, it's a lot. By optimizing io_uring and cleaning the junk, I hit a stable 243k IOPS on consumer hardware.

This isn't just another reskin. It’s an architecture for people who actually care about their data and their hardware's true potential.

Looking for feedback from the low-level wizards here. ISO and benchmarks are up on Hugging Face.

Cheers."

Link to Download: https://huggingface.co/datasets/verumnodeslipknotspeed/verumnodewreckedmachines
Thanks for the heads up.

I was wondering from the link, in relation to the license, what the following line meant:
License: Proprietary / Proof of Concept
 
I was wondering from the link, in relation to the license, what the following line meant:

Good catch (though I didn't look).

You can't edit the kernel and then close the source. The GPL doesn't allow for such. In fact, it expressly prohibits it.

You must distribute the code for a modified kernel.

Actually, if you do not distribute the code, you can declare it is any license you want -- but that defeats the whole point, as you can't then share the modified kernel.

You can, with GPLv2, write code that runs on top of the kernel and then make it proprietary. GPLv3 tries to prevent that, but the kernel is GPLv2. Not a whole lot out there is GPLv3, even though it has been around for a long time now.

But if you distribute a compiled binary, you still have to make the source code available upon request. You must do so in a reasonable and timely manner. If you're sharing the source on physical media, you can charge a reasonable amount for that.

If you do try to make it proprietary, you're going to get sued -- and rightfully so. Heck, I have some email addresses for some older devs. I'll make sure of it.
 
Hahaha amusing.
He forked GPL Linux kernel and changed licence to proprietary/closed.
Which LLM tool told you this is allowed?
Good luck dude
 
Should it come to it...



 


Follow Linux.org

Members online


Top