January 2026

Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers

digitado ⋅ 28 de January de 2026

Transformer-based multimodal large language models often exhibit in-context learning (ICL) abilities. Motivated by this phenomenon, we ask: how do transformers learn to associate information across modalities from in-context examples? We investigate this question through controlled experiments on small transformers trained on synthetic classification tasks, enabling precise manipulation of data statistics and model architecture. We begin by revisiting core principles of unimodal ICL in modern transformers. While several prior findings replicate, we find that Rotary Position Embeddings (RoPE) increases […]

Ver mais

Like 0

Liked Liked

technocracy

Meta blocks links to ICE List across Facebook, Instagram, and Threads

digitado ⋅ 28 de January de 2026

Meta has started blocking its users from sharing links to ICE List, a website that has compiled the names of what it claims are Department of Homeland Security employees, a project the creators say is designed to hold those employees accountable. Dominick Skinner, the creator of ICE List, tells WIRED that links to the website have been shared without issue on Meta’s platforms for more than six months. “I think it’s no surprise that a company run by […]

Ver mais

Like 0

Liked Liked

technocracy

Report: China approves import of high-end Nvidia AI chips after weeks of uncertainty

digitado ⋅ 28 de January de 2026

On Wednesday, China approved imports of Nvidia’s H200 artificial intelligence chips for three of its largest technology companies, Reuters reported. ByteDance, Alibaba, and Tencent received approval to purchase more than 400,000 H200 chips in total, marking a shift in Beijing’s stance after weeks of holding up shipments despite US export clearance. The move follows Beijing’s temporary halt to H200 shipments earlier this month after Washington cleared exports on January 13. Chinese customs authorities had told agents that the […]

Ver mais

Like 0

Liked Liked

technocracy

South Carolina tops Texas measles outbreak record—with no end in sight

digitado ⋅ 28 de January de 2026

The explosive measles outbreak in South Carolina has now reached 789 cases, breaking Texas’s outbreak record last year of 762 cases, which at the time was the largest outbreak in the US since measles was declared eliminated from the US in 2000. The country is at grave risk of losing its elimination status in the coming months due to continuous spread. With Texas’ outbreak last year—which spanned January to August and spread to additional states—the US saw the […]

Ver mais

Like 0

Liked Liked

technocracy

Active Learning for Decision Trees with Provable Guarantees

digitado ⋅ 28 de January de 2026

This paper advances the theoretical understanding of active learning label complexity for decision trees as binary classifiers. We make two main contributions. First, we provide the first analysis of the disagreement coefficient for decision trees-a key parameter governing active learning label complexity. Our analysis holds under two natural assumptions required for achieving polylogarithmic label complexity, (i) each root-to-leaf path queries distinct feature dimensions, and (ii) the input data has a regular, grid-like structure. We show these assumptions are […]

Ver mais

Like 0

Liked Liked

technocracy

When More Data Doesn’t Help: Limits of Adaptation in Multitask Learning

digitado ⋅ 28 de January de 2026

Multitask learning and related frameworks have achieved tremendous success in modern applications. In multitask learning problem, we are given a set of heterogeneous datasets collected from related source tasks and hope to enhance the performance above what we could hope to achieve by solving each of them individually. The recent work of arXiv:2006.15785 has showed that, without access to distributional information, no algorithm based on aggregating samples alone can guarantee optimal risk as long as the sample size […]

Ver mais

Like 0

Liked Liked

technocracy

Cross-Country Learning for National Infectious Disease Forecasting Using European Data

digitado ⋅ 28 de January de 2026

Accurate forecasting of infectious disease incidence is critical for public health planning and timely intervention. While most data-driven forecasting approaches rely primarily on historical data from a single country, such data are often limited in length and variability, restricting the performance of machine learning (ML) models. In this work, we investigate a cross-country learning approach for infectious disease forecasting, in which a single model is trained on time series data from multiple countries and evaluated on a country […]

Ver mais

Like 0

Liked Liked

technocracy

Leveraging Second-Order Curvature for Efficient Learned Image Compression: Theory and Empirical Evidence

digitado ⋅ 28 de January de 2026

Training learned image compression (LIC) models entails navigating a challenging optimization landscape defined by the fundamental trade-off between rate and distortion. Standard first-order optimizers, such as SGD and Adam, struggle with emph{gradient conflicts} arising from competing objectives, leading to slow convergence and suboptimal rate-distortion performance. In this work, we demonstrate that a simple utilization of a second-order quasi-Newton optimizer, textbf{SOAP}, dramatically improves both training efficiency and final performance across diverse LICs. Our theoretical and empirical analyses reveal that […]

Ver mais

Like 0

Liked Liked

technocracy

Leveraging Second-Order Curvature for Efficient Learned Image Compression: Theory and Empirical Evidence

digitado ⋅ 28 de January de 2026

Ver mais

Like 0

Liked Liked

technocracy

Ryzen 9850X3D review: AMD’s bragging-rights gaming CPU gets more to brag about

digitado ⋅ 28 de January de 2026

AMD has released three distinct generations of its 3D V-Cache technology, which initially appeared in the Ryzen 7 5800X3D in 2022. The kernel of the idea has remained the same throughout AMD’s efforts: take an existing desktop processor design and graft 64MB of additional L3 cache onto it. This approach disproportionately helps apps that benefit from more cache, particularly games, and the size of the boost that 3D V-Cache gives to game performance has always been enough to […]

Ver mais

Like 0

Liked Liked