digitado

Train Less, Infer Faster: Efficient Model Finetuning and Compression via Structured Sparsity

digitado ⋅ 11 de February de 2026

arXiv:2602.09169v1 Announce Type: new Abstract: Fully finetuning foundation language models (LMs) with billions of parameters is often impractical due to high computational costs, memory requirements, and the risk of overfitting. Although methods like low-rank adapters help address these challenges by adding small trainable modules to the frozen LM, they also increase memory usage and do not reduce inference latency. We uncover an intriguing phenomenon: sparsifying specific model rows and columns enables efficient task adaptation without requiring weight tuning. […]

Ver mais

Like 0

Liked Liked

technocracy

Cooperative UAVs for Remote Data Collection under Limited Communications: An Asynchronous Multiagent Learning Framework

digitado ⋅ 19 de January de 2026

arXiv:2601.10849v1 Announce Type: new Abstract: This paper addresses the joint optimization of trajectories and bandwidth allocation for multiple Unmanned Aerial Vehicles (UAVs) to enhance energy efficiency in the cooperative data collection problem. We focus on an important yet underestimated aspect of the system, where action synchronization across all UAVs is impossible. Since most existing learning-based solutions are not designed to learn in this asynchronous environment, we formulate the trajectory planning problem as a Decentralized Partially Observable Semi-Markov Decision […]

Ver mais

Like 0

Liked Liked

technocracy

A full process algebraic representation of Ant Colony Optimization

digitado ⋅ 22 de January de 2026

arXiv:2601.14436v1 Announce Type: new Abstract: We present a process algebra capable of specifying parallelized Ant Colony Optimization algorithms in full detail: PA$^2$CO. After explaining the basis of three different ACO algorithms (Ant System, MAX-MIN Ant System, and Ant Colony System), we formally define PA$^2$CO and use it for representing several types of implementations with different parallel schemes. In particular fine-grained and coarse-grained specifications, each one taking advantage of parallel executions at different levels of system granularity, are formalized.

Ver mais

Like 0

Liked Liked

technocracy

County pays $600,000 to pentesters it arrested for assessing courthouse security

digitado ⋅ 29 de January de 2026

Two security professionals who were arrested in 2019 after performing an authorized security assessment of a county courthouse in Iowa will receive $600,000 to settle a lawsuit they brought alleging wrongful arrest and defamation. The case was brought by Gary DeMercurio and Justin Wynn, two penetration testers who at the time were employed by Colorado-based security firm Coalfire Labs. The men had written authorization from the Iowa Judicial Branch to conduct “red-team” exercises, meaning attempted security breaches that […]

Ver mais

Like 0

Liked Liked

technocracy

RPU — A Reasoning Processing Unit

digitado ⋅ 24 de February de 2026

arXiv:2602.18568v1 Announce Type: new Abstract: Large language model (LLM) inference performance is increasingly bottlenecked by the memory wall. While GPUs continue to scale raw compute throughput, they struggle to deliver scalable performance for memory bandwidth bound workloads. This challenge is amplified by emerging reasoning LLM applications, where long output sequences, low arithmetic intensity, and tight latency constraints demand significantly higher memory bandwidth. As a result, system utilization drops and energy per inference rises, highlighting the need for an […]

Ver mais

Like 0

Liked Liked

technocracy

Externally Validated Longitudinal GRU Model for Visit-Level 180-Day Mortality Risk in Metastatic Castration-Resistant Prostate Cancer

digitado ⋅ 29 de January de 2026

arXiv:2601.20046v1 Announce Type: new Abstract: Metastatic castration-resistant prostate cancer (mCRPC) is a highly aggressive disease with poor prognosis and heterogeneous treatment response. In this work, we developed and externally validated a visit-level 180-day mortality risk model using longitudinal data from two Phase III cohorts (n=526 and n=640). Only visits with observable 180-day outcomes were labeled; right-censored cases were excluded from analysis. We compared five candidate architectures: Long Short-Term Memory, Gated Recurrent Unit (GRU), Cox Proportional Hazards, Random Survival […]

Ver mais

Like 0

Liked Liked

technocracy

PCoA: A New Benchmark for Medical Aspect-Based Summarization With Phrase-Level Context Attribution

digitado ⋅ 8 de January de 2026

arXiv:2601.03418v1 Announce Type: new Abstract: Verifying system-generated summaries remains challenging, as effective verification requires precise attribution to the source context, which is especially crucial in high-stakes medical domains. To address this challenge, we introduce PCoA, an expert-annotated benchmark for medical aspect-based summarization with phrase-level context attribution. PCoA aligns each aspect-based summary with its supporting contextual sentences and contributory phrases within them. We further propose a fine-grained, decoupled evaluation framework that independently assesses the quality of generated summaries, citations, […]

Ver mais

Like 0

Liked Liked

technocracy

Tigrinya Number Verbalization: Rules, Algorithm, and Implementation

digitado ⋅ 8 de January de 2026

arXiv:2601.03403v1 Announce Type: new Abstract: We present a systematic formalization of Tigrinya cardinal and ordinal number verbalization, addressing a gap in computational resources for the language. This work documents the canonical rules governing the expression of numerical values in spoken Tigrinya, including the conjunction system, scale words, and special cases for dates, times, and currency. We provide a formal algorithm for number-to-word conversion and release an open-source implementation. Evaluation of frontier large language models (LLMs) reveals significant gaps […]

Ver mais

Like 0

Liked Liked

technocracy

Tarski-Hierarchical Perspective on Scientific Progress: Minimum Description Length as the Universal Criterion of Progress

digitado ⋅ 4 de March de 2026

It is a universally acknowledged heuristic of science that, all else being equal, a theory with fewer free parameters that explains more empirical data is superior. Yet, this intuitive preference is rarely formalized into a strict, operational objective function. This paper formally translates that heuristic into an invariant mathematical boundary condition from a strict Tarski Level-2 vantage point. We advance the Minimum Description Length (MDL) principle—grounded in Algorithmic Information Theory (AIT) and Computability Theory (CT)—not as a philosophical […]

Ver mais

Like 0

Liked Liked

technocracy

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

digitado ⋅ 5 de March de 2026

arXiv:2603.03475v1 Announce Type: new Abstract: Mathematical reasoning models are widely deployed in education, automated tutoring, and decision support systems despite exhibiting fundamental computational instabilities. We demonstrate that state-of-the-art models (Qwen2.5-Math-7B) achieve 61% accuracy through a mixture of reliable and unreliable reasoning pathways: 18.4% of correct predictions employ stable, faithful reasoning while 81.6% emerge through computationally inconsistent pathways. Additionally, 8.8% of all predictions are silent failures — confident yet incorrect outputs. Through comprehensive analysis using novel faithfulness metrics, we […]

Ver mais

Like 0

Liked Liked