digitado – Page 219

Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning

digitado ⋅ 9 de February de 2026

arXiv:2602.06204v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) is a standard tool for parameter-efficient finetuning of large models. While it induces a small memory footprint, its training dynamics can be surprisingly complex as they depend on several hyperparameters such as initialization, adapter rank, and learning rate. In particular, it is unclear how the optimal learning rate scales with adapter rank, which forces practitioners to re-tune the learning rate whenever the rank is changed. In this paper, we introduce […]

Ver mais

Like 0

Liked Liked

technocracy

Malliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement Learning

digitado ⋅ 1 de April de 2026

Inverse reinforcement learning (IRL) recovers the loss function of a forward learner from its observed responses adaptive IRL aims to reconstruct the loss function of a forward learner by passively observing its gradients as it performs reinforcement learning (RL). This paper proposes a novel passive Langevin-based algorithm that achieves adaptive IRL. The key difficulty in adaptive IRL is that the required gradients in the passive algorithm are counterfactual, that is, they are conditioned on events of probability zero […]

Ver mais

Like 0

Liked Liked

technocracy

Can LLMs Perform Synthesis?

digitado ⋅ 24 de March de 2026

arXiv:2603.20264v1 Announce Type: new Abstract: How do LLMs compare with symbolic tools on program synthesis tasks? We investigate this question on several synthesis domains: LTL reactive synthesis, syntax-guided synthesis, distributed protocol synthesis, and recursive function synthesis. For each domain, we choose a state-of-the-art symbolic tool and compare it to an open-source, 32 billion parameter version of the Qwen LLM and the proprietary, frontier LLM GPT-5. We couple Qwen with a symbolic verifier and run it repeatedly until it […]

Ver mais

Like 0

Liked Liked

technocracy

Mystery GPS jammer in Iran becomes test for NASA satellites’ capabilities

digitado ⋅ 27 de May de 2026

NASA satellites designed to observe cyclone wind speeds and collapsing ice sheets have also proven capable of identifying the approximate locations of GPS jammers. That could help monitor high-risk areas for aircraft and ships navigating the growing prevalence of GPS interference worldwide. Two different NASA satellite systems showed how they could locate a known but mysterious GPS jammer within several kilometers of its position in Iran, according to an experiment by Sean Gorman, CEO and cofounder of the […]

Ver mais

Like 0

Liked Liked

technocracy

Build a solar flare detection system on SageMaker AI LSTM networks and ESA STIX data

digitado ⋅ 30 de March de 2026

The effective monitoring and characterization of solar flares demands sophisticated analysis of X-ray emissions across multiple energy spectrums. Machine learning-based anomaly detection serves as a powerful tool for identifying significant patterns that could indicate notable solar activity. Through the identification of distinct radiation signatures, key solar event characteristics can be detected, analyzed, and comprehensively understood. These detected patterns are essential for various applications, including space weather forecasting, solar physics investigations, and satellite operation planning. In recent years, solar […]

Ver mais

Like 0

Liked Liked

technocracy

Scalable voice agent design with Amazon Nova Sonic: multi-agent, tools, and session segmentation

digitado ⋅ 19 de May de 2026

Design patterns for scalable voice agents matter for organizations that need to deliver fast, natural, and reliable voice experiences. Many teams face challenges like high latency, managing real-time audio, and coordinating multiple agents in complex workflows. In this post, you’ll learn how to use Amazon Nova Sonic, Amazon Bedrock AgentCore, and Strands BidiAgent to build scalable, maintainable voice agents that handle these challenges efficiently, resulting in more responsive and intelligent customer interactions. We’ll explore three popular architectural patterns […]

Ver mais

Like 0

Liked Liked

technocracy

NSF renews support for MIT-led AI and physics institute, expanding a new model for discovery

digitado ⋅ 19 de June de 2026

The MIT-led Institute for Artificial Intelligence and Fundamental Interactions (IAIFI) has received renewed support from the National Science Foundation (NSF) for an additional five years, increasing annual funding from $4 million to $4.98 million. The renewal marks a new phase for IAIFI, which has spent its first five years building a research model and an interdisciplinary community around a central premise: that AI can open new ways of doing physics, while physics can help mold better AI systems. […]

Ver mais

Like 0

Liked Liked

technocracy

Agentic AI in Action -Part 10 -Beyond Frameworks: Building the Core Loop From First Principles

digitado ⋅ 24 de February de 2026

Beyond Frameworks: Building the Agent Core Loop From First Principles In the series of Agentic AI blogs, we have seen practical examples built using frameworks. We have also examined guardrails, governance, and why many impressive agent demos struggle when they encounter real operational complexity. At this point, it becomes useful to pause and strip everything back to first principles. Frameworks play an important role in accelerating the development of agentic systems and in turning ideas into working solutions […]

Ver mais

Like 0

Liked Liked

technocracy

Online Learning for Supervisory Switching Control

digitado ⋅ 16 de March de 2026

We study supervisory switching control for partially-observed linear dynamical systems. The objective is to identify and deploy the best controller for the unknown system by periodically selecting among a collection of $N$ candidate controllers, some of which may destabilize the underlying system. While classical estimator-based supervisory control guarantees asymptotic stability, it lacks quantitative finite-time performance bounds. Conversely, current non-asymptotic methods in both online learning and system identification require restrictive assumptions that are incompatible in a control setting, such […]

Ver mais

Like 0

Liked Liked

technocracy

Building Reliable Machine Learning Systems for Heart Disease Prediction

digitado ⋅ 4 de January de 2026

Image Source: https://www.technologynetworks.com/diagnostics/news/wealth-and-education-play-significant-role-in-heart-disease-risk-396976 Why ensemble methods generalize better than deep learning on clinical tabular data Heart disease continues to be the leading cause of death worldwide, responsible for millions of deaths every year. Despite advances in clinical diagnostics, early and accurate detection remains a persistent challenge. Traditional diagnostic procedures are often invasive, expensive, and heavily dependent on physician interpretation. This is where Machine Learning (ML) offers a compelling alternative. In this article, I present a comprehensive machine learning study […]

Ver mais

Like 0

Liked Liked