January 2026

Part3: Guide to Hugging-face AutoModels** for Audio

digitado ⋅ 14 de January de 2026

In series of AutoModel** for We have discussed for Text based NLP models in part 1 and Vision based Models in Part2 Now we will discuss the Audio Based Models in this part We will cover: How Hugging Face represents audio tasks Core AutoModelFor** classes for audio Common architectures behind them Practical examples (speech recognition, audio classification, text-to-speech) Tips for choosing the right class Audio Tasks in Hugging Face Audio models operate on waveforms or audio features instead of tokens. Hugging Face standardizes this workflow using: Datasets: […]

Ver mais

Like 0

Liked Liked

technocracy

Deep Learning-based Binary Analysis for Vulnerability Detection in x86-64 Machine Code

digitado ⋅ 14 de January de 2026

While much of the current research in deep learning-based vulnerability detection relies on disassembled binaries, this paper explores the feasibility of extracting features directly from raw x86-64 machine code. Although assembly language is more interpretable for humans, it requires more complex models to capture token-level context. In contrast, machine code may enable more efficient, lightweight models and preserve all information that might be lost in disassembly. This paper approaches the task of vulnerability detection through an exploratory study […]

Ver mais

Like 0

Liked Liked

technocracy

Discrete Solution Operator Learning for Geometry-Dependent PDEs

digitado ⋅ 14 de January de 2026

Neural operator learning accelerates PDE solution by approximating operators as mappings between continuous function spaces. Yet in many engineering settings, varying geometry induces discrete structural changes, including topological changes, abrupt changes in boundary conditions or boundary types, and changes in the computational domain, which break the smooth-variation premise. Here we introduce Discrete Solution Operator Learning (DiSOL), a complementary paradigm that learns discrete solution procedures rather than continuous function-space operators. DiSOL factorizes the solver into learnable stages that mirror […]

Ver mais

Like 0

Liked Liked

technocracy

Discrete Solution Operator Learning for Geometry-Dependent PDEs

digitado ⋅ 14 de January de 2026

Neural operator learning accelerates PDE solution by approximating operators as mappings between continuous function spaces. Yet in many engineering settings, varying geometry induces discrete structural changes, including topological changes, abrupt changes in boundary conditions or boundary types, and changes in the effective computational domain, which break the smooth-variation premise. Here we introduce Discrete Solution Operator Learning (DiSOL), a complementary paradigm that learns discrete solution procedures rather than continuous function-space operators. DiSOL factorizes the solver into learnable stages that […]

Ver mais

Like 0

Liked Liked

technocracy

A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication

digitado ⋅ 14 de January de 2026

The GEneral Matrix Multiplication (GEMM) is one of the essential algorithms in scientific computing. Single-thread GEMM implementations are well-optimised with techniques like blocking and autotuning. However, due to the complexity of modern multi-core shared memory systems, it is challenging to determine the number of threads that minimises the multi-thread GEMM runtime. We present a proof-of-concept approach to building an Architecture and Data-Structure Aware Linear Algebra (ADSALA) software library that uses machine learning to optimise the runtime performance of […]

Ver mais

Like 0

Liked Liked

technocracy

Comparative Assessment of Concrete Compressive Strength Prediction at Industry Scale Using Embedding-based Neural Networks, Transformers, and Traditional Machine Learning Approaches

digitado ⋅ 14 de January de 2026

Concrete is the most widely used construction material worldwide; however, reliable prediction of compressive strength remains challenging due to material heterogeneity, variable mix proportions, and sensitivity to field and environmental conditions. Recent advances in artificial intelligence enable data-driven modeling frameworks capable of supporting automated decision-making in construction quality control. This study leverages an industry-scale dataset consisting of approximately 70,000 compressive strength test records to evaluate and compare multiple predictive approaches, including linear regression, decision trees, random forests, transformer-based […]

Ver mais

Like 0

Liked Liked

technocracy

SRT: Accelerating Reinforcement Learning via Speculative Rollout with Tree-Structured Cache

digitado ⋅ 14 de January de 2026

We present Speculative Rollout with Tree-Structured Cache (SRT), a simple, model-free approach to accelerate on-policy reinforcement learning (RL) for language models without sacrificing distributional correctness. SRT exploits the empirical similarity of rollouts for the same prompt across training steps by storing previously generated continuations in a per-prompt tree-structured cache. During generation, the current policy uses this tree as the draft model for performing speculative decoding. To keep the cache fresh and improve draft model quality, SRT updates trees […]

Ver mais

Like 0

Liked Liked

technocracy

Lean Clients, Full Accuracy: Hybrid Zeroth- and First-Order Split Federated Learning

digitado ⋅ 14 de January de 2026

Split Federated Learning (SFL) enables collaborative training between resource-constrained edge devices and a compute-rich server. Communication overhead is a central issue in SFL and can be mitigated with auxiliary networks. Yet, the fundamental client-side computation challenge remains, as back-propagation requires substantial memory and computation costs, severely limiting the scale of models that edge devices can support. To enable more resource-efficient client computation and reduce the client-server communication, we propose HERON-SFL, a novel hybrid optimization framework that integrates zeroth-order […]

Ver mais

Like 0

Liked Liked

technocracy

Train my reaction time and other things.

digitado ⋅ 14 de January de 2026

If i were to zap myself everytime i got under 190ms reaction time and kept lowering the threshold and made a program do the zaping would i increase my reaction time. if so i would also like to do that with data processing so showing a certain amount of numbers on a screen for a quarter second and trying to memorize all of the numbers increasing the amount of number gradually and zapping myself for every wrong number […]

Ver mais

Like 0

Liked Liked

technocracy

Curated papers on Physical AI – VLAs, world models, robot foundation models

digitado ⋅ 14 de January de 2026

Made a list tracking the Physical AI space — foundation models that control robots. Covers Vision-Language-Action (VLA) models like RT-2 and π₀, world models (DreamerV3, Genie 2, JEPA), diffusion policies, real-world deployment and latency problems, cross-embodiment transfer, scaling laws, and safety/alignment for robots. Organized by architecture → action representation → learning paradigm → deployment. GitHub in comments. Star if useful, PRs welcome. submitted by /u/kwk236 [link] [comments]

Ver mais

Like 0

Liked Liked