technocracy

Faster Gradient Methods for Highly-Smooth Stochastic Bilevel Optimization

digitado ⋅ 10 de March de 2026

arXiv:2509.02937v2 Announce Type: replace-cross Abstract: This paper studies the complexity of finding an $epsilon$-stationary point for stochastic bilevel optimization when the upper-level problem is nonconvex and the lower-level problem is strongly convex. Recent work proposed the first-order method, F${}^2$SA, achieving the $tilde{mathcal{O}}(epsilon^{-6})$ upper complexity bound for first-order smooth problems. This is slower than the optimal $Omega(epsilon^{-4})$ complexity lower bound in its single-level counterpart. In this work, we show that faster rates are achievable for higher-order smooth problems. We […]

Ver mais

Like 0

Liked Liked

technocracy

Communication-Efficient Multi-Modal Edge Inference via Uncertainty-Aware Distributed Learning

digitado ⋅ 21 de January de 2026

Semantic communication is emerging as a key enabler for distributed edge intelligence due to its capability to convey task-relevant meaning. However, achieving communication-efficient training and robust inference over wireless links remains challenging. This challenge is further exacerbated for multi-modal edge inference (MMEI) by two factors: 1) prohibitive communication overhead for distributed learning over bandwidth-limited wireless links, due to the emph{multi-modal} nature of the system; and 2) limited robustness under varying channels and noisy multi-modal inputs. In this paper, […]

Ver mais

Like 0

Liked Liked

technocracy

ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation

digitado ⋅ 7 de January de 2026

arXiv:2601.02535v1 Announce Type: new Abstract: Selecting a single high-quality output from multiple stochastic generations remains a fundamental challenge for large language models (LLMs), particularly in open-ended tasks where no canonical answer exists. While Best-of-N and self-consistency methods show that aggregating multiple generations can improve performance, existing approaches typically rely on external evaluators, reward models, or exact string-match voting, limiting their applicability and efficiency. We propose Mode Extraction (ModeX), an evaluator-free Best-of-N selection framework that generalizes majority voting to […]

Ver mais

Like 0

Liked Liked

technocracy

Multi-Objective Reinforcement Learning for Generating Covalent Inhibitor Candidates

digitado ⋅ 21 de April de 2026

Rational design of covalent inhibitors requires simultaneously optimizing multiple properties, such as binding affinity, target selectivity, or electrophilic reactivity. This presents a multi-objective problem not easily addressed by screening alone. Here we present a machine learning pipeline for generating covalent inhibitor candidates using multi-objective reinforcement learning (RL), applied to two targets: epidermal growth factor receptor (EGFR) and acetylcholinesterase (ACHE). A SMILES-based pretrained LSTM serves as the generative model, optimized via policy gradient RL with Pareto crowding distance to […]

Ver mais

Like 0

Liked Liked

technocracy

Provably Extracting the Features from a General Superposition

digitado ⋅ 1 de April de 2026

arXiv:2512.15987v2 Announce Type: replace-cross Abstract: It is widely believed that complex machine learning models generally encode features through linear representations. This is the foundational hypothesis behind a vast body of work on interpretability. A key challenge toward extracting interpretable features, however, is that they exist in superposition. In this work, we study the question of extracting features in superposition from a learning theoretic perspective. We start with the following fundamental setting: we are given query access to a […]

Ver mais

Like 0

Liked Liked

technocracy

Revisiting Software Engineering Education in the Era of Large Language Models: A Curriculum Adaptation and Academic Integrity Framework

digitado ⋅ 15 de January de 2026

arXiv:2601.08857v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs), such as ChatGPT and GitHub Copilot, into professional workflows is increasingly reshaping software engineering practices. These tools have lowered the cost of code generation, explanation, and testing, while introducing new forms of automation into routine development tasks. In contrast, most of the software engineering and computer engineering curricula remain closely aligned with pedagogical models that equate manual syntax production with technical competence. This growing misalignment raises […]

Ver mais

Like 0

Liked Liked

technocracy

Deployment-Oriented Session-wise Meta-Calibration for Landmark-Based Webcam Gaze Tracking

digitado ⋅ 16 de March de 2026

arXiv:2603.12388v1 Announce Type: new Abstract: Practical webcam gaze tracking is constrained not only by error, but also by calibration burden, robustness to head motion and session drift, runtime footprint, and browser use. We therefore target a deployment-oriented operating point rather than the image large-backbone regime. We cast landmark-based point-of-regard estimation as session-wise adaptation: a shared geometric encoder produces embeddings that can be aligned to a new session from a small calibration set. We present Equivariant Meta-Calibrated Gaze (EMC-Gaze), […]

Ver mais

Like 0

Liked Liked

technocracy

BanditLP: Large-Scale Stochastic Optimization for Personalized Recommendations

digitado ⋅ 23 de January de 2026

arXiv:2601.15552v1 Announce Type: cross Abstract: We present BanditLP, a scalable multi-stakeholder contextual bandit framework that unifies neural Thompson Sampling for learning objective-specific outcomes with a large-scale linear program for constrained action selection at serving time. The methodology is application-agnostic, compatible with arbitrary neural architectures, and deployable at web scale, with an LP solver capable of handling billions of variables. Experiments on public benchmarks and synthetic data show consistent gains over strong baselines. We apply this approach in LinkedIn’s […]

Ver mais

Like 0

Liked Liked

technocracy

Long-Term Probabilistic Forecast of Vegetation Conditions Using Climate Attributes in the Four Corners Region

digitado ⋅ 26 de January de 2026

arXiv:2601.16347v1 Announce Type: cross Abstract: Weather conditions can drastically alter the state of crops and rangelands, and in turn, impact the incomes and food security of individuals worldwide. Satellite-based remote sensing offers an effective way to monitor vegetation and climate variables on regional and global scales. The annual peak Normalized Difference Vegetation Index (NDVI), derived from satellite observations, is closely associated with crop development, rangeland biomass, and vegetation growth. Although various machine learning methods have been developed to […]

Ver mais

Like 0

Liked Liked

technocracy

The TCF doesn’t really A(A)ID — Automatic Privacy Analysis and Legal Compliance of TCF-based Android Applications

digitado ⋅ 25 de February de 2026

arXiv:2602.20222v1 Announce Type: new Abstract: The Transparency and Consent Framework (TCF), developed by the Interactive Advertising Bureau (IAB) Europe, provides a de facto standard for requesting, recording, and managing user consent from European end-users. This framework has previously been found to infringe European data protection law and has subsequently been regularly updated. Previous research on the TCF focused exclusively on web contexts, with no attention given to its implementation in mobile applications. No work has systematically studied the […]

Ver mais

Like 0

Liked Liked