digitado – Page 280

Thinking with Deltas: Incentivizing Reinforcement Learning via Differential Visual Reasoning Policy

digitado ⋅ 11 de January de 2026

Reinforcement Learning with Verifiable Rewards (RLVR) has significantly advanced reasoning capabilities in Large Language Models. However, adapting RLVR to multimodal domains suffers from a critical textit{perception-reasoning decoupling}. Existing paradigms, driven by text-centric outcome rewards, reasoning in language medium, inadvertently encourage models to bypass visual perception. We empirically validate this through blind experiments: state-of-the-art policies maintain or surprisingly improve performance even when visual inputs are entirely removed. This reveals that these models degenerate into textit{blind reasoners}, exploiting linguistic priors […]

Ver mais

Like 0

Liked Liked

technocracy

RECAP: A Resource-Efficient Method for Adversarial Prompting in Large Language Models

digitado ⋅ 23 de January de 2026

arXiv:2601.15331v1 Announce Type: new Abstract: The deployment of large language models (LLMs) has raised security concerns due to their susceptibility to producing harmful or policy-violating outputs when exposed to adversarial prompts. While alignment and guardrails mitigate common misuse, they remain vulnerable to automated jailbreaking methods such as GCG, PEZ, and GBDA, which generate adversarial suffixes via training and gradient-based search. Although effective, these methods particularly GCG are computationally expensive, limiting their practicality for organisations with constrained resources. This […]

Ver mais

Like 0

Liked Liked

technocracy

Inductive Convolution Nuclear Norm Minimization for Tensor Completion with Arbitrary Sampling

digitado ⋅ 21 de April de 2026

arXiv:2604.17001v1 Announce Type: new Abstract: The recently established Convolution Nuclear Norm Minimization (CNNM) addresses the problem of textit{tensor completion with arbitrary sampling} (TCAS), which involves restoring a tensor from a subset of its entries sampled in an arbitrary manner. Despite its promising performance, the optimization procedure of CNNM needs performing Singular Value Decomposition (SVD) multiple times, which is computationally expensive and hard to parallelize. To address the issue, we reformulate the optimization objective of CNNM from the perspective […]

Ver mais

Like 0

Liked Liked

technocracy

Outliers Detection in PySpark #3 – K-means

digitado ⋅ 6 de August de 2019

In parts #1 and #2 of the “Outliers Detection in PySpark” series, I talked about Anomaly Detection, Outliers Detection and the interquartile range (boxplot) method. In this third and last part, I will talk about how one can use the popular K-means clustering algorithm to detect outliers. K-means K-means is one of the easiest and most popular unsupervised algorithms in Machine Learning for Clustering.

Ver mais

Like 0

Liked Liked

technocracy

Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization

digitado ⋅ 1 de April de 2026

arXiv:2603.28959v1 Announce Type: new Abstract: The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, yet how Large Language Models (LLMs) reason about and manage this trade-off remains poorly understood. Unlike Bayesian Optimization, where exploration and exploitation are explicitly encoded through acquisition functions, LLM-based optimization relies on implicit, prompt-based reasoning over historical evaluations, making search behavior difficult to analyze or control. In this work, we present a metric-level study of LLM-mediated search policy learning, studying how LLMs […]

Ver mais

Like 0

Liked Liked

technocracy

NeST: Neuron Selective Tuning for LLM Safety

digitado ⋅ 20 de February de 2026

arXiv:2602.16835v1 Announce Type: new Abstract: Safety alignment is essential for the responsible deployment of large language models (LLMs). Yet, existing approaches often rely on heavyweight fine-tuning that is costly to update, audit, and maintain across model families. Full fine-tuning incurs substantial computational and storage overhead, while parameter-efficient methods such as LoRA trade efficiency for inconsistent safety gains and sensitivity to design choices. Safety intervention mechanisms such as circuit breakers reduce unsafe outputs without modifying model weights, but do […]

Ver mais

Like 0

Liked Liked

technocracy

RL for modeling rodent behavior?

digitado ⋅ 2 de February de 2026

I’ve seen some pretty cool work using Q learning and HMMs to model rat behavior in some pretty complex behavioral paradigms, <e.g learning a contrast gradient with psychometric function etc…) but for very classical associative learning, are there any interesting approaches that one might use? What properties/parameters of conditioned learning, e.g. beyond learning rate might be interesting to try to pull out by fitting RLs? submitted by /u/traydblockzplz [link] [comments]

Ver mais

Like 0

Liked Liked

technocracy

Exposing biases, moods, personalities, and abstract concepts hidden in large language models

digitado ⋅ 23 de April de 2026

By now, ChatGPT, Claude, and other large language models have accumulated so much human knowledge that they’re far from simple answer-generators; they can also express abstract concepts, such as certain tones, personalities, biases, and moods. However, it’s not obvious exactly how these models represent abstract concepts to begin with from the knowledge they contain. Now a team from MIT and the University of California San Diego has developed a way to test whether a large language model (LLM) […]

Ver mais

Like 0

Liked Liked

technocracy

The Toxic Status Game Inside Indie Hacking

digitado ⋅ 9 de April de 2026

The indie hacker hierarchy is bullshit, and I fell for it anyway. I bought another domain this week. I don’t actually need it. I already have a business that works, one that pays me every month, serves a real audience, and has been running for almost a decade. Wtf. But I bought it anyway because brainstorming ideas with Claude and browsing Namecheap at night kinda feels like building. I tweeted about this recently and the gist was that […]

Ver mais

Like 0

Liked Liked

technocracy

Trapped by simplicity: When Transformers fail to learn from noisy features

digitado ⋅ 9 de February de 2026

Noise is ubiquitous in data used to train large language models, but it is not well understood whether these models are able to correctly generalize to inputs generated without noise. Here, we study noise-robust learning: are transformers trained on data with noisy features able to find a target function that correctly predicts labels for noiseless features? We show that transformers succeed at noise-robust learning for a selection of $k$-sparse parity and majority functions, compared to LSTMs which fail […]

Ver mais

Like 0

Liked Liked