March 2026

Revisiting Model Stitching In the Foundation Model Era

digitado ⋅ 17 de March de 2026

arXiv:2603.12433v2 Announce Type: new Abstract: Model stitching, connecting early layers of one model (source) to later layers of another (target) via a light stitch layer, has served as a probe of representational compatibility. Prior work finds that models trained on the same dataset remain stitchable (negligible accuracy drop) despite different initializations or objectives. We revisit stitching for Vision Foundation Models (VFMs) that vary in objectives, data, and modality mix (e.g., CLIP, DINOv2, SigLIP 2) and ask: Are heterogeneous […]

Ver mais

Like 0

Liked Liked

technocracy

Optimizing Task Completion Time Updates Using POMDPs

digitado ⋅ 17 de March de 2026

arXiv:2603.12340v2 Announce Type: new Abstract: Managing announced task completion times is a fundamental control problem in project management. While extensive research exists on estimating task durations and task scheduling, the problem of when and how to update completion times communicated to stakeholders remains understudied. Organizations must balance announcement accuracy against the costs of frequent timeline updates, which can erode stakeholder trust and trigger costly replanning. Despite the prevalence of this problem, current approaches rely on static predictions or […]

Ver mais

Like 0

Liked Liked

technocracy

Weak-Driven Learning: Your discarded checkpoints can make your strong models stronger

digitado ⋅ 17 de March de 2026

We just released a paper with a finding that surprised us during our own training runs: weaker, earlier checkpoints of a model can actually drive further improvement in a strong model that has already saturated under standard SFT. The conventional wisdom is clear — weak models give you weak signal. Knowledge distillation flows from strong teacher to weak student. We found the opposite direction works too, and for a different reason. The problem we noticed: Once a model […]

Ver mais

Like 0

Liked Liked

technocracy

Parallel In-context Learning for Large Vision Language Models

digitado ⋅ 17 de March de 2026

Large vision-language models (LVLMs) employ multi-modal in-context learning (MM-ICL) to adapt to new tasks by leveraging demonstration examples. While increasing the number of demonstrations boosts performance, they incur significant inference latency due to the quadratic computational cost of Transformer attention with respect to the context length. To address this trade-off, we propose Parallel In-Context Learning (Parallel-ICL), a plug-and-play inference algorithm. Parallel-ICL partitions the long demonstration context into multiple shorter, manageable chunks. It processes these chunks in parallel and […]

Ver mais

Like 0

Liked Liked

technocracy

Disposable Apps Are Here: Where Does the Value Go When Anyone Can Clone Your Product?

digitado ⋅ 17 de March de 2026

AI just deleted the three moats that protected software for 40 years. Here’s what replaces them. I wrote about the coming wave of disposable software a while back. That piece was a prediction. Now, it’s a pattern. Here’s what’s actually happening – and where the moat moved. In early 2024, a small team launched an AI productivity app. Within 72 hours, a dozen clones were live on Product Hunt. Same prompts. Same interface. Same pitch. Some of them […]

Ver mais

Like 0

Liked Liked

technocracy

Software Values: Tech Stacks, Minimalism, and More

digitado ⋅ 17 de March de 2026

Values Values instill a goal of action to perform. The realist, by the endless winters of death, has fashioned new methods of action. The right way is the best way in any condition, and the best foot forward. What values do you consider important in developing your software? How to put into practise design-to-development principles in software development? Tech Stacks Software frameworks differ based on the purposes of the software to define the tech stacks. What programming languages […]

Ver mais

Like 0

Liked Liked

technocracy

Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition

digitado ⋅ 17 de March de 2026

Human Activity Recognition using wearable inertial sensors is foundational to healthcare monitoring, fitness analytics, and context-aware computing, yet its deployment is hindered by cross-user variability arising from heterogeneous physiological traits, motor habits, and sensor placements. Existing domain generalization approaches either neglect temporal dependencies in sensor streams or depend on impractical target-domain annotations. We propose a different paradigm: modeling generalizable feature extraction as a collaborative sequential generation process governed by reinforcement learning. Our framework, CTFG (Collaborative Temporal Feature Generation), […]

Ver mais

Like 0

Liked Liked

technocracy

The Hidden Tax of Cloud BI: Zombie Data Movement Between Platforms

digitado ⋅ 17 de March de 2026

Your dashboards may be cheap. The data movement behind them often is not. At first, the dashboard looked harmless. It was a simple operational report. Sales by store. n Last 90 days. n A few filters. The query ran in seconds. The warehouse computing cost was minimal. From the BI team’s perspective, everything looked efficient. But when the cloud billing report arrived at the end of the month, something strange appeared. Network transfer charges were increasing. Slowly at […]

Ver mais

Like 0

Liked Liked

technocracy

What the Heck is Apache Iggy?

digitado ⋅ 17 de March de 2026

Introduction If you follow the message streaming space at all, you know it is dominated by a few big names. Apache Kafka has been the 800-pound gorilla since LinkedIn open-sourced it back in 2011. I’ve covered streaming-adjacent projects a few times in this series, like WarpStream, Apache Paimon, and Proton. So when I started seeing chatter about a message streaming platform written from scratch in Rust that was processing millions of messages per second, I had to take […]

Ver mais

Like 0

Liked Liked

technocracy

The Hidden Failure Mode in Multi-Agent Review

digitado ⋅ 17 de March de 2026

A few days ago my review stage did the most dangerous thing a multi‑agent system can do: it looked like it worked. The UI showed progress. The pipeline marched forward. And yet one of the agents had effectively returned “nothing,” which meant my final decision was being computed from a lie—an average that quietly pretended a missing opinion existed. That’s the moment you stop thinking about “LLM evals” and start thinking about defensive systems engineering. This post is […]

Ver mais

Like 0

Liked Liked