digitado – Page 36

Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple

digitado ⋅ 26 de January de 2026

arXiv:2601.16294v1 Announce Type: new Abstract: General Matrix Multiplication (GEMM) is the cornerstone of Deep Learning and HPC workloads; accordingly, academia and industry have heavily optimized this kernel. Modern platforms with matrix multiplication accelerators exhibit high FLOP/Byte machine balance, which makes implementing optimal matrix multiplication challenging. On modern CPU platforms with matrix engines, state-of-the-art vendor libraries tune input tensor layouts, parallelization schemes, and cache blocking to minimize data movement across the memory hierarchy and maximize throughput. However, the best […]

Ver mais

Like 0

Liked Liked

technocracy

Investigating the Multilingual Calibration Effects of Language Model Instruction-Tuning

digitado ⋅ 6 de January de 2026

arXiv:2601.01362v1 Announce Type: cross Abstract: Ensuring that deep learning models are well-calibrated in terms of their predictive uncertainty is essential in maintaining their trustworthiness and reliability, yet despite increasing advances in foundation model research, the relationship between such large language models (LLMs) and their calibration remains an open area of research. In this work, we look at a critical gap in the calibration of LLMs within multilingual settings, in an attempt to better understand how the data scarcity […]

Ver mais

Like 0

Liked Liked

technocracy

Non-intrusive Learning of Physics-Informed Spatio-temporal Surrogate for Accelerating Design

digitado ⋅ 15 de April de 2026

Most practical engineering design problems involve nonlinear spatio-temporal dynamical systems. Multi-physics simulations are often performed to capture the fine spatio-temporal scales which govern the evolution of these systems. However, these simulations are often high-fidelity in nature, and can be computationally very expensive. Hence, generating data from these expensive simulations becomes a bottleneck in an end-to-end engineering design process. Spatio-temporal surrogate modeling of these dynamical systems has been a popular data-driven solution to tackle this computational bottleneck. This is […]

Ver mais

Like 0

Liked Liked

technocracy

From Human Interfaces to Agent Interfaces: Rethinking Software Design in the Age of AI-Native Systems

digitado ⋅ 24 de March de 2026

arXiv:2603.20300v1 Announce Type: new Abstract: Software systems have traditionally been designed for human interaction, emphasizing graphical user interfaces, usability, and cognitive alignment with end users. However, recent advances in large language model (LLM)-based agents are changing the primary consumers of software systems. Increasingly, software is no longer only used by humans, but also invoked autonomously by AI agents through structured interfaces. In this paper, we argue that software engineering is undergoing a paradigm shift from human-oriented interfaces to […]

Ver mais

Like 0

Liked Liked

technocracy

Is my GRPO LLM training on my ETL-Doctor-Pipeline-Env working?

digitado ⋅ 22 de April de 2026

https://preview.redd.it/hg6sw1ps6qwg1.png?width=897&format=png&auto=webp&s=ffbc86307eb7f8ab88a7fbb132cd69c20fe62c33 I am training Qwen3-0.6B on an RL environment made specifically for llms which I made myself. Feeling lost and confused. Here is the HF space link: https://huggingface.co/spaces/Atharva1232/etl_pipeline_doctor and here’s the github: https://github.com/Its-Atharva-Gupta/EPL-Pipeline-Doctor-Env I did use claude code for making the environment, since this is for a hackathon and the time limit is really short. Is my training going well or do I refactor something? submitted by /u/Full_Promotion4522 [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Conflict-Aware Multimodal Fusion for Ambivalence and Hesitancy Recognition

digitado ⋅ 18 de March de 2026

arXiv:2603.15818v1 Announce Type: new Abstract: Ambivalence and hesitancy (A/H) are subtle affective states where a person shows conflicting signals through different channels — saying one thing while their face or voice tells another story. Recognising these states automatically is valuable in clinical settings, but it is hard for machines because the key evidence lives in the emph{disagreements} between what is said, how it sounds, and what the face shows. We present textbf{ConflictAwareAH}, a multimodal framework built for this […]

Ver mais

Like 0

Liked Liked

technocracy

Non-existence of Information-Geometric Fermat Structures: Violation of Dual Lattice Consistency in Statistical Manifolds with $L^n$ Structure

digitado ⋅ 11 de February de 2026

arXiv:2602.09028v1 Announce Type: new Abstract: This paper reformulates Fermat’s Last Theorem as an embedding problem of information-geometric structures. We reinterpret the Fermat equation as an $n$-th moment constraint, constructing a statistical manifold $mathcal{M}_n$ of generalized normal distributions via the Maximum Entropy Principle. By Chentsov’s Theorem, the natural metric is the Fisher information metric ($L^2$); however, the global structure is governed by the $L^n$ moment constraint. This reveals a discrepancy between the local quadratic metric and the global $L^n$ […]

Ver mais

Like 0

Liked Liked

technocracy

Pierre Teilhard de Chardin: My Universe

digitado ⋅ 15 de January de 2025

Written to clarify ideas that bewildered even his closest friends, Teilhard de Chardin explains his lifelong struggle: reconciling love of God with love of the world. From childhood, he sought the Absolute—first in metal, then matter, finally the cosmos itself. His revelation: Christ doesn’t compete with the universe but completes it, making every human endeavor a form of worship toward one divine convergence.

Ver mais

Like 0

Liked Liked

technocracy

Enhance and Reuse: A Dual-Mechanism Approach to Boost Deep Forest for Label Distribution Learning

digitado ⋅ 6 de February de 2026

Label distribution learning (LDL) requires the learner to predict the degree of correlation between each sample and each label. To achieve this, a crucial task during learning is to leverage the correlation among labels. Deep Forest (DF) is a deep learning framework based on tree ensembles, whose training phase does not rely on backpropagation. DF performs in-model feature transform using the prediction of each layer and achieves competitive performance on many tasks. However, its exploration in the field […]

Ver mais

Like 0

Liked Liked

technocracy

The brain power behind sustainable AI

digitado ⋅ 24 de October de 2025

How can you use science to build a better gingerbread house? That was something Miranda Schwacke spent a lot of time thinking about. The MIT graduate student in the Department of Materials Science and Engineering (DMSE) is part of Kitchen Matters, a group of grad students who use food and kitchen tools to explain scientific concepts through short videos and outreach events. Past topics included why chocolate “seizes,” or becomes difficult to work with when melting (spoiler: water gets in), and […]

Ver mais

Like 0

Liked Liked