digitado – Page 403

Gradient flow in parameter space is equivalent to linear interpolation in output space

digitado ⋅ 14 de January de 2026

arXiv:2408.01517v3 Announce Type: replace-cross Abstract: We prove that the standard gradient flow in parameter space that underlies many training algorithms in deep learning can be continuously deformed into an adapted gradient flow which yields (constrained) Euclidean gradient flow in output space. Moreover, for the $L^{2}$ loss, if the Jacobian of the outputs with respect to the parameters is full rank (for fixed training data), then the time variable can be reparametrized so that the resulting flow is simply […]

Ver mais

Like 0

Liked Liked

technocracy

Is Prompt Selection Necessary for Task-Free Online Continual Learning?

digitado ⋅ 6 de April de 2026

Task-free online continual learning has recently emerged as a realistic paradigm for addressing continual learning in dynamic, real-world environments, where data arrive in a non-stationary stream without clear task boundaries and can only be observed once. To consider such challenging scenarios, many recent approaches have employed prompt selection, an adaptive strategy that selects prompts from a pool based on input signals. However, we observe that such selection strategies often fail to select appropriate prompts, yielding suboptimal results despite […]

Ver mais

Like 0

Liked Liked

technocracy

Analytic Parametric Multi-Step Solution of All Area and Moments Integrals of General Green’s Theorem for Arbitrary Ellipse Region, Part 1: Central Sector

digitado ⋅ 20 de April de 2026

In this paper, all the integrals and their solutions are given for the analytical calculation of all six area and moments values of the arbitrary ellipse region given in trigonometric parametric form, based on the general moment form of Green’s theorem curve integral obtained from the discrete and differential vector product methods. The actual area and moments values of the arbitrary ellipse regions are then calculated by application of Boolean algebra on the ellipse parts and their remaining […]

Ver mais

Like 0

Liked Liked

technocracy

Trust Region Masking for Long-Horizon LLM Reinforcement Learning

digitado ⋅ 2 de March de 2026

arXiv:2512.23075v4 Announce Type: replace-cross Abstract: Policy gradient methods for Large Language Models optimize a policy $pi_theta$ via a surrogate objective computed from samples of a rollout policy $pi_{text{roll}}$. However, modern LLM-RL pipelines suffer from unavoidable implementation divergences — backend discrepancies, Mixture-of-Experts routing discontinuities, and distributed training staleness — causing off-policy mismatch ($pi_{text{roll}} neq pi_theta$) and approximation errors between the surrogate and the true objective. We demonstrate that classical trust region bounds on this error scale as $O(T^2)$ with […]

Ver mais

Like 0

Liked Liked

technocracy

PCEval: A Benchmark for Evaluating Physical Computing Capabilities of Large Language Models

digitado ⋅ 7 de January de 2026

arXiv:2601.02404v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains, including software development, education, and technical assistance. Among these, software development is one of the key areas where LLMs are increasingly adopted. However, when hardware constraints are considered-for instance, in physical computing, where software must interact with and control physical hardware -their effectiveness has not been fully explored. To address this gap, we introduce textsc{PCEval} (Physical Computing Evaluation), the first benchmark in […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-Head Attention based interaction-aware architecture for Bangla Handwritten Character Recognition: Introducing a Primary Dataset

digitado ⋅ 15 de April de 2026

arXiv:2604.09717v1 Announce Type: new Abstract: Character recognition is the fundamental part of an optical character recognition (OCR) system. Word recognition, sentence transcription, document digitization, and language processing are some of the higher-order activities that can be done accurately through character recognition. Nonetheless, recognizing handwritten Bangla characters is not an easy task because they are written in different styles with inconsistent stroke patterns and a high degree of visual character resemblance. The datasets available are usually limited in intra-class […]

Ver mais

Like 0

Liked Liked

technocracy

De confidente a vendedor: cómo la publicidad amenaza la confianza en ChatGPT

digitado ⋅ 23 de January de 2026

Hay un momento en la historia de cada tecnología en el deja de ser un refugio inmaculado para convertirse en un espacio comercial. La radio lo hizo cuando surgieron los anuncios que financiaron programas, internet lo hizo cuando las páginas se llenaron de banners, las redes sociales lo hicieron cuando los muros de amigos se transformaron en malditos escaparates. Hoy, estamos a punto de ver lo mismo con la primera generación de la inteligencia artificial conversacional, un producto […]

Ver mais

Like 0

Liked Liked

technocracy

Robotic Assembly Using Deep Reinforcement Learning

digitado ⋅ 21 de October de 2020

Introduction Disclaimer: This article is a cross post from Pytorch Medium Blog Post. One of the most exciting advancements, that has pushed the frontier of the Artificial Intelligence (AI) in recent years, is Deep Reinforcement Learning (DRL). DRL belongs to the family of machine learning algorithms. It assumes that intelligent machines can learn from their actions similar to the way humans learn from experience. Over the recent years we could witness some impressive real-world applications of DRL. The […]

Ver mais

Like 0

Liked Liked

technocracy

Yann LeCun’s $1B bet against LLMs

digitado ⋅ 11 de March de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. Few people in AI have been louder about LLMs being a dead end than Yann LeCun. Even fewer have a Turing Award and a billion dollars to do something about it. His new Advanced Machine Intelligence just launched with over $1B in funding to build what he believes LLMs never can: AI that actually understands the real world. In today’s AI rundown: LeCun’s […]

Ver mais

Like 0

Liked Liked

technocracy

Seasonal Switch 2 sales show significant slowing as annual cycle sunsets

digitado ⋅ 8 de January de 2026

Nintendo’s Switch 2 was an unmitigated market success for Nintendo following its launch last June, selling a record-setting 3.5 million units worldwide in its first four days and reaching over 10 million shipments in just under four months. But a new report from The Game Business suggests that frenzied initial sales pace may have slowed significantly in many markets during the system’s crucial first holiday season. The report suggests that US Switch 2 sales were down about 35 […]

Ver mais

Like 0

Liked Liked