digitado

DeepAFL: Deep Analytic Federated Learning

digitado ⋅ 28 de February de 2026

Federated Learning (FL) is a popular distributed learning paradigm to break down data silo. Traditional FL approaches largely rely on gradient-based updates, facing significant issues about heterogeneity, scalability, convergence, and overhead, etc. Recently, some analytic-learning-based work has attempted to handle these issues by eliminating gradient-based updates via analytical (i.e., closed-form) solutions. Despite achieving superior invariance to data heterogeneity, these approaches are fundamentally limited by their single-layer linear model with a frozen pre-trained backbone. As a result, they can […]

Ver mais

Like 0

Liked Liked

technocracy

Solving and learning advective multiscale Darcian dynamics with the Neural Basis Method

digitado ⋅ 19 de February de 2026

Physics-governed models are increasingly paired with machine learning for accelerated predictions, yet most "physics–informed" formulations treat the governing equations as a penalty loss whose scale and meaning are set by heuristic balancing. This blurs operator structure, thereby confounding solution approximation error with governing-equation enforcement error and making the solving and learning progress hard to interpret and control. Here we introduce the Neural Basis Method, a projection-based formulation that couples a predefined, physics-conforming neural basis space with an operator-induced […]

Ver mais

Like 0

Liked Liked

technocracy

Why do only big ML labs dominate widely-used models despite many open-source pretrained models smaller labs could do RL on? [D]

digitado ⋅ 26 de April de 2026

I’m trying to understand why models from major labs (GPT, Claude, etc.) dominate real-world usage? You might say it’s due to the expensive pretraining compute budge, but there already exists many pretrained open-source models at the same scale (e.g., Kimi). Of course Kimi isn’t as good as Claude, but it’s the RL on top of the pretraining that makes Claude what it is right? Given Kimi, DeepSeek etc all have the expensive pretraining done, the RLHF on top […]

Ver mais

Like 0

Liked Liked

technocracy

La batería de sodio y el futuro

digitado ⋅ 9 de February de 2026

En el análisis de las transiciones tecnológicas solemos caer en dos extremos: pensar que la adopción de una nueva tecnología será lineal y predecible, o por el contrario, creer que la innovación disruptiva emerge de la nada y en un instante redefinirá el ecosistema existente. La noticia reciente de que CATL y Changan preparan el primer vehículo de pasajeros con batería de ion-sodio en producción masiva para 2026 es presentada como un hito histórico en el sector del […]

Ver mais

Like 0

Liked Liked

technocracy

The Rise of Synthetic Labor

digitado ⋅ 16 de February de 2026

Author(s): Sam Okoye Originally published on Towards AI. Abstract Advanced economies are entering a sustained structural labor deficit driven by demographic decline, aging populations, and persistent sector-specific shortages. Traditional automation, including robotic process automation and narrow task-based systems, has delivered productivity improvements but has proven insufficient to address this gap at scale. This paper introduces synthetic labor, defined as agentic artificial intelligence systems that perform economically productive work with context awareness, memory, planning, tool use, coordination, and governance. […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Model Predictive Control with Physics-Informed Neural Network for Satellite Attitude Control

digitado ⋅ 19 de February de 2026

arXiv:2602.15954v1 Announce Type: new Abstract: Reliable spacecraft attitude control depends on accurate prediction of attitude dynamics, particularly when model-based strategies such as Model Predictive Control (MPC) are employed, where performance is limited by the quality of the internal system model. For spacecraft with complex dynamics, obtaining accurate physics-based models can be difficult, time-consuming, or computationally heavy. Learning-based system identification presents a compelling alternative; however, models trained exclusively on data frequently exhibit fragile stability properties and limited extrapolation capability. […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Federated and Split Learning for Privacy Preserving Clinical Prediction and Treatment Optimization

digitado ⋅ 17 de February de 2026

Collaborative clinical decision support is often constrained by governance and privacy rules that prevent pooling patient-level records across institutions. We present a hybrid privacy-preserving framework that combines Federated Learning (FL) and Split Learning (SL) to support decision-oriented healthcare modeling without raw-data sharing. The approach keeps feature-extraction trunks on clients while hosting prediction heads on a coordinating server, enabling shared representation learning and exposing an explicit collaboration boundary where privacy controls can be applied. Rather than assuming distributed training […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-Head Attention based interaction-aware architecture for Bangla Handwritten Character Recognition: Introducing a Primary Dataset

digitado ⋅ 15 de April de 2026

arXiv:2604.09717v1 Announce Type: new Abstract: Character recognition is the fundamental part of an optical character recognition (OCR) system. Word recognition, sentence transcription, document digitization, and language processing are some of the higher-order activities that can be done accurately through character recognition. Nonetheless, recognizing handwritten Bangla characters is not an easy task because they are written in different styles with inconsistent stroke patterns and a high degree of visual character resemblance. The datasets available are usually limited in intra-class […]

Ver mais

Like 0

Liked Liked

technocracy

Stochastic approximation in non-markovian environments revisited

digitado ⋅ 24 de March de 2026

arXiv:2603.21091v1 Announce Type: new Abstract: Based on some recent work of the author on stochastic approximation in non-markovian environments, the situation when the driving random process is non-ergodic in addition to being non-markovian is considered. Using this, we propose an analytic framework for understanding transformer based learning, specifically, the `attention’ mechanism, and continual learning, both of which depend on the entire past in principle.

Ver mais

Like 0

Liked Liked

technocracy

DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

digitado ⋅ 2 de March de 2026

arXiv:2602.23438v1 Announce Type: new Abstract: Graphic layouts serve as an important and engaging medium for visual communication across different channels. While recent layout generation models have demonstrated impressive capabilities, they frequently fail to align with nuanced human aesthetic judgment. Existing preference datasets and reward models trained on text-to-image generation do not generalize to layout evaluation, where the spatial arrangement of identical elements determines quality. To address this critical gap, we introduce DesignSense-10k, a large-scale dataset of 10,235 human-annotated […]

Ver mais

Like 0

Liked Liked