digitado

Bandit Allocational Instability

digitado ⋅ 10 de February de 2026

arXiv:2602.07472v1 Announce Type: cross Abstract: When multi-armed bandit (MAB) algorithms allocate pulls among competing arms, the resulting allocation can exhibit huge variation. This is particularly harmful in modern applications such as learning-enhanced platform operations and post-bandit statistical inference. Thus motivated, we introduce a new performance metric of MAB algorithms termed allocation variability, which is the largest (over arms) standard deviation of an arm’s number of pulls. We establish a fundamental trade-off between allocation variability and regret, the canonical […]

Ver mais

Like 0

Liked Liked

technocracy

Nonconvex Latent Optimally Partitioned Block-Sparse Recovery via Log-Sum and Minimax Concave Penalties

digitado ⋅ 3 de March de 2026

arXiv:2603.01304v1 Announce Type: cross Abstract: We propose two nonconvex regularization methods, LogLOP-l2/l1 and AdaLOP-l2/l1, for recovering block-sparse signals with unknown block partitions. These methods address the underestimation bias of existing convex approaches by extending log-sum penalty and the Minimax Concave Penalty (MCP) to the block-sparse domain via novel variational formulations. Unlike Generalized Moreau Enhancement (GME) and Bayesian methods dependent on the squared-error data fidelity term, our proposed methods are compatible with a broad range of data fidelity terms. […]

Ver mais

Like 0

Liked Liked

technocracy

De-ICE Disco at the Googleplex

digitado ⋅ 31 de January de 2026

When Renee Good and Alex Pretti were murdered, and I saw the incredible courage of people in Minneapolis in the face of state brutality, I had to find some way to show that tech workers stand with Minnesota, even if our leaders don’t. I signed the ICEout petition, and I’d encourage you to do the same. I’ve also been talking to the press about why I signed it, and on the Wired Uncanny Valley podcast Kate Drummond asked […]

Ver mais

Like 0

Liked Liked

technocracy

Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women’s Reproductive Health

digitado ⋅ 4 de February de 2026

Polycystic Ovary Syndrome (PCOS) is a widespread disorder in women of reproductive age, characterized by a hormonal imbalance, irregular periods, and multiple ovarian cysts. Infertility, metabolic syndrome, and cardiovascular risks are long-term complications that make early detection essential. In this paper, we design a powerful framework based on transfer learning utilizing DenseNet201 and ResNet50 for classifying ovarian ultrasound images. The model was trained on an online dataset containing 3856 ultrasound images of cyst-infected and non-infected patients. Each ultrasound […]

Ver mais

Like 0

Liked Liked

technocracy

Which RL-Library for variable Environment-Spaces?

digitado ⋅ 13 de January de 2026

Hello guys, which library would be the best training a RL-Agent on different Environment spaces. I am working on a Scheduler, which schedules task to maschines. There are Dataset which contain for example 10 maschines and 50 operations and then 5 maschines and 20 operations. So my Gym Environment is changing based on different datasets. I get this error below when im using SB3: My Question ist, are there librarys that can deal with this? ValueError Traceback (most […]

Ver mais

Like 0

Liked Liked

technocracy

StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation

digitado ⋅ 4 de February de 2026

Cybergrooming is an evolving threat to youth, necessitating proactive educational interventions. We propose StagePilot, an offline RL-based dialogue agent that simulates the stage-wise progression of grooming behaviors for prevention training. StagePilot selects conversational stages using a composite reward that balances user sentiment and goal proximity, with transitions constrained to adjacent stages for realism and interpretability. We evaluate StagePilot through LLM-based simulations, measuring stage completion, dialogue efficiency, and emotional engagement. Results show that StagePilot generates realistic and coherent conversations […]

Ver mais

Like 0

Liked Liked

technocracy

Diffusion-based Generative Machine Learning Model for Predicting Crack Propagation in Aluminum Nitride at the Atomic Scale

digitado ⋅ 13 de March de 2026

Predicting atomic-scale crack propagation in aluminum nitride (AlN) is critical for semiconductor reliability but remains prohibitively expensive via molecular dynamics (MD). We develop a diffusion-based generative machine learning model to predict atomic-scale crack propagation in AlN, a critical semiconductor material, by conditioning solely on initial microstructure embeddings. Trained on MD simulations of single-crack systems, the model achieves a significant speedup while accurately forecasting dynamic fracture processes, including stress-driven crack initiation, crack branching, and atomic-scale bridging ligaments. Crucially, it […]

Ver mais

Like 0

Liked Liked

technocracy

3DSPA: A 3D Semantic Point Autoencoder for Evaluating Video Realism

digitado ⋅ 25 de February de 2026

arXiv:2602.20354v1 Announce Type: new Abstract: AI video generation is evolving rapidly. For video generators to be useful for applications ranging from robotics to film-making, they must consistently produce realistic videos. However, evaluating the realism of generated videos remains a largely manual process — requiring human annotation or bespoke evaluation datasets which have restricted scope. Here we develop an automated evaluation framework for video realism which captures both semantics and coherent 3D structure and which does not require access […]

Ver mais

Like 0

Liked Liked

technocracy

Yann LeCun’s $1B bet against LLMs

digitado ⋅ 11 de March de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. Few people in AI have been louder about LLMs being a dead end than Yann LeCun. Even fewer have a Turing Award and a billion dollars to do something about it. His new Advanced Machine Intelligence just launched with over $1B in funding to build what he believes LLMs never can: AI that actually understands the real world. In today’s AI rundown: LeCun’s […]

Ver mais

Like 0

Liked Liked

technocracy

CLASP: An online learning algorithm for Convex Losses And Squared Penalties

digitado ⋅ 22 de January de 2026

We study Constrained Online Convex Optimization (COCO), where a learner chooses actions iteratively, observes both unanticipated convex loss and convex constraint, and accumulates loss while incurring penalties for constraint violations. We introduce CLASP (Convex Losses And Squared Penalties), an algorithm that minimizes cumulative loss together with squared constraint violations. Our analysis departs from prior work by fully leveraging the firm non-expansiveness of convex projectors, a proof strategy not previously applied in this setting. For convex losses, CLASP achieves […]

Ver mais

Like 0

Liked Liked