technocracy

When Gradient Optimization Is Not Enough: $dagger$ Dispersive and Anchoring Geometric Regularizer for Multimodal Learning

digitado ⋅ 29 de January de 2026

Multimodal learning aims to integrate complementary information from heterogeneous modalities, yet strong optimization alone does not guaranty well-structured representations. Even under carefully balanced training schemes, multimodal models often exhibit geometric pathologies, including intra-modal representation collapse and sample-level cross-modal inconsistency, which degrade both unimodal robustness and multimodal fusion. We identify representation geometry as a missing control axis in multimodal learning and propose regName, a lightweight geometry-aware regularization framework. regName enforces two complementary constraints on intermediate embeddings: an intra-modal dispersive […]

Ver mais

Like 0

Liked Liked

technocracy

Deep Two-Way Matrix Reordering for Relational Data Analysis

digitado ⋅ 17 de February de 2026

arXiv:2103.14203v5 Announce Type: replace Abstract: Matrix reordering is a task to permute the rows and columns of a given observed matrix such that the resulting reordered matrix shows meaningful or interpretable structural patterns. Most existing matrix reordering techniques share the common processes of extracting some feature representations from an observed matrix in a predefined manner, and applying matrix reordering based on it. However, in some practical cases, we do not always have prior knowledge about the structural pattern […]

Ver mais

Like 0

Liked Liked

technocracy

Exact and general decoupled solutions of the LMC Multitask Gaussian Process model

digitado ⋅ 18 de March de 2026

arXiv:2310.12032v3 Announce Type: replace-cross Abstract: The Linear Model of Co-regionalization (LMC) is a very general multitask gaussian process model for regression or classification. While its expressiveness and conceptual simplicity are appealing, naive implementations have cubic complexity in the product (number of datapoints $times$ number of tasks), making approximations mandatory for most applications. However, recent work has shown that in some settings the latent processes of the model can be decoupled, leading to a complexity that is only linear […]

Ver mais

Like 0

Liked Liked

technocracy

Federated Learning for the Design of Parametric Insurance Indices under Heterogeneous Renewable Production Losses

digitado ⋅ 17 de January de 2026

We propose a federated learning framework for the calibration of parametric insurance indices under heterogeneous renewable energy production losses. Producers locally model their losses using Tweedie generalized linear models and private data, while a common index is learned through federated optimization without sharing raw observations. The approach accommodates heterogeneity in variance and link functions and directly minimizes a global deviance objective in a distributed setting. We implement and compare FedAvg, FedProx and FedOpt, and benchmark them against an […]

Ver mais

Like 0

Liked Liked

technocracy

Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data

digitado ⋅ 10 de March de 2026

arXiv:2505.09496v3 Announce Type: replace Abstract: Offline reinforcement learning (RL) aims to find optimal policies in dynamic environments in order to maximize the expected total rewards by leveraging pre-collected data. Learning from heterogeneous data is one of the fundamental challenges in offline RL. Traditional methods focus on learning an optimal policy for all individuals with pre-collected data from a single episode or homogeneous batch episodes, and thus, may result in a suboptimal policy for a heterogeneous population. In this […]

Ver mais

Like 0

Liked Liked

technocracy

AWS and Hopkins Engineering announce groundbreaking database for AI/ML antibody design

digitado ⋅ 14 de April de 2026

In 1986 the US Food and Drug Administration issued its first approval for human use of a therapeutic antibody. Despite steady advances in methodology, genetic sequencing, and biomedical science, 40 years later the process of discovering and optimizing therapeutic antibodies often remains prohibitively expensive, in terms of both cost and time. Recent experiences with pandemic-style infectious-disease outbreaks lend an even greater urgency to the need to more quickly and efficiently identify and develop these antibodies. Artificial-intelligence- and machine-learning-guided […]

Ver mais

Like 0

Liked Liked

technocracy

DaggerFFT: A Distributed FFT Framework Using Task Scheduling in Julia

digitado ⋅ 21 de January de 2026

arXiv:2601.12209v1 Announce Type: new Abstract: The Fast Fourier Transform (FFT) is a fundamental numerical technique with widespread application in a range of scientific problems. As scientific simulations attempt to exploit exascale systems, there has been a growing demand for distributed FFT algorithms that can effectively utilize modern heterogeneous high-performance computing (HPC) systems. Conventional FFT algorithms commonly encounter performance bottlenecks, especially when run on heterogeneous platforms. Most distributed FFT approaches rely on static task distribution and require synchronization barriers, […]

Ver mais

Like 0

Liked Liked

technocracy

What Matters for Simulation to Online Reinforcement Learning on Real Robots

digitado ⋅ 25 de February de 2026

arXiv:2602.20220v1 Announce Type: new Abstract: We investigate what specific design choices enable successful online reinforcement learning (RL) on physical robots. Across 100 real-world training runs on three distinct robotic platforms, we systematically ablate algorithmic, systems, and experimental decisions that are typically left implicit in prior work. We find that some widely used defaults can be harmful, while a set of robust, readily adopted design choices within standard RL practice yield stable learning across tasks and hardware. These results […]

Ver mais

Like 0

Liked Liked

technocracy

Planning under Distribution Shifts with Causal POMDPs

digitado ⋅ 2 de March de 2026

arXiv:2602.23545v1 Announce Type: new Abstract: In the real world, planning is often challenged by distribution shifts. As such, a model of the environment obtained under one set of conditions may no longer remain valid as the distribution of states or the environment dynamics change, which in turn causes previously learned strategies to fail. In this work, we propose a theoretical framework for planning under partial observability using Partially Observable Markov Decision Processes (POMDPs) formulated using causal knowledge. By […]

Ver mais

Like 0

Liked Liked

technocracy

Domain-Skewed Federated Learning with Feature Decoupling and Calibration

digitado ⋅ 15 de March de 2026

Federated learning (FL) allows distributed clients to collaboratively train a global model in a privacy-preserving manner. However, one major challenge is domain skew, where clients’ data originating from diverse domains may hinder the aggregated global model from learning a consistent representation space, resulting in poor generalizable ability in multiple domains. In this paper, we argue that the domain skew is reflected in the domain-specific biased features of each client, causing the local model’s representations to collapse into a […]

Ver mais

Like 0

Liked Liked