technocracy

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

digitado ⋅ 12 de February de 2026

arXiv:2602.10224v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as an effective approach for enhancing the reasoning capabilities of Large Language Models (LLMs). Despite its efficacy, RLVR faces a meta-learning bottleneck: it lacks mechanisms for error attribution and experience internalization intrinsic to the human learning cycle beyond practice and verification, thereby limiting fine-grained credit assignment and reusable knowledge formation. We term such reusable knowledge representations derived from past errors as meta-experience. Based on this […]

Ver mais

Like 0

Liked Liked

technocracy

Red Skills or Blue Skills? A Dive Into Skills Published on ClawHub

digitado ⋅ 16 de April de 2026

arXiv:2604.13064v1 Announce Type: new Abstract: Skill ecosystems have emerged as an increasingly important layer in Large Language Model (LLM) agent systems, enabling reusable task packaging, public distribution, and community-driven capability sharing. However, despite their rapid growth, the functionality, ecosystem structure, and security risks of public skill registries remain underexplored. In this paper, we present an empirical study of ClawHub, a large public registry of agent skills. We build and normalize a dataset of 26,502 skills, and conduct a […]

Ver mais

Like 0

Liked Liked

technocracy

Oldest octopus fossil found to not be an octopus

digitado ⋅ 10 de April de 2026

Pohlsepia mazonensis, a visually underwhelming fossil from Illinois, fundamentally broke our understanding of cephalopod evolution. Described in 2000 and hailed as the oldest known octopus in the fossil record, the specimen dated back to the late Carboniferous period, roughly 311 to 306 million years ago. Pohlsepia was an outlier—all other fossil records strongly suggested that crown coleoids, the group containing octopuses, squid, and cuttlefish, diverged much later, during the Jurassic. To solve this puzzle, Thomas Clements, a paleontologist […]

Ver mais

Like 0

Liked Liked

technocracy

Integrating Distribution Matching into Semi-Supervised Contrastive Learning for Labeled and Unlabeled Data

digitado ⋅ 8 de January de 2026

The advancement of deep learning has greatly improved supervised image classification. However, labeling data is costly, prompting research into unsupervised learning methods such as contrastive learning. In real-world scenarios, fully unlabeled datasets are rare, making semi-supervised learning (SSL) highly relevant in scenarios where a small amount of labeled data coexists with a large volume of unlabeled data. A well-known semi-supervised contrastive learning approach involves assigning pseudo-labels to unlabeled data. This study aims to enhance pseudo-label-based SSL by incorporating […]

Ver mais

Like 0

Liked Liked

technocracy

Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

digitado ⋅ 22 de April de 2026

In streaming platforms churn is extremely costly, yet A/B tests are typically evaluated using outcomes observed within a limited experimental horizon. Even when both short- and predicted long-term engagement metrics are considered, they may fail to capture how a treatment affects users’ retention. Consequently, an intervention may appear beneficial in the short term and neutral in the long term while still generating lower total value than the control due to users churn. To address this limitation, we introduce […]

Ver mais

Like 0

Liked Liked

technocracy

National Software Testing Conference Announces Industry-Leading Speaker Line-Up For 2026

digitado ⋅ 11 de March de 2026

London, UK The National Software Testing Conference (NSTC) 2026, which will be held at the Grand Connaught Rooms in London from July 14–15, 2026,have announced their speaker line-up for this year. NSTC, one of the top conferences for professionals in software testing, quality assurance, and quality engineering in the UK, is back with an agenda that looks to the future and examines the role of artificial intelligence, accessibility, security, and contemporary testing techniques. Senior decision-makers and practitioners from […]

Ver mais

Like 0

Liked Liked

technocracy

General Bayesian Policy Learning

digitado ⋅ 27 de February de 2026

This study proposes the General Bayes framework for policy learning. We consider decision problems in which a decision-maker chooses an action from an action set to maximize its expected welfare. Typical examples include treatment choice and portfolio selection. In such problems, the statistical target is a decision rule, and the prediction of each outcome $Y(a)$ is not necessarily of primary interest. We formulate this policy learning problem by loss-based Bayesian updating. Our main technical device is a squared-loss […]

Ver mais

Like 0

Liked Liked

technocracy

This Entrepreneur Just Built the Most Structurally Honest Review Platform in the World

digitado ⋅ 6 de May de 2026

About 30% of all online reviews are considered fake. On some major platforms, up to 47% of reviews have been flagged as suspicious. AI-generated fake reviews have been growing 80% month-over-month since mid-2023, according to The Transparency Company. Google blocked or removed over 240 million policy-violating reviews in 2024, up from 170 million the year before. And in December 2025, the U.S. Federal Trade Commission issued its first enforcement action under its new Consumer Review Rule, warning ten […]

Ver mais

Like 0

Liked Liked

technocracy

Ding-dong! The Exploration Upper Stage is dead

digitado ⋅ 7 de March de 2026

In his 1961 novel The Winter of Our Discontent, John Steinbeck wrote of loss, “It’s so much darker when a light goes out than it would have been if it had never shone.” The death of NASA’s Exploration Upper Stage today represents the inverse of that sentiment. The world of spaceflight is so much brighter now that its light has gone out. The rocket’s death came via a seemingly pedestrian notice posted on a government procurement website: “NASA/MSFC […]

Ver mais

Like 0

Liked Liked

technocracy

FedSEA: Achieving Benefit of Parallelization in Federated Online Learning

digitado ⋅ 21 de April de 2026

Online federated learning (OFL) has emerged as a popular framework for decentralized decision-making over continuous data streams without compromising client privacy. However, the adversary model assumed in standard OFL typically precludes any potential benefits of parallelization. Further, it fails to adequately capture the different sources of statistical variation in OFL problems. In this paper, we extend the OFL paradigm by integrating a stochastically extended adversary (SEA). Under this framework, the loss function remains fixed across clients over time. […]

Ver mais

Like 0

Liked Liked