digitado

Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers

digitado ⋅ 7 de March de 2026

In the training of neural networks, adaptive moment estimation (Adam) typically converges fast but exhibits suboptimal generalization performance. A widely accepted explanation for its defect in generalization is that it often tends to converge to sharp minima. To enhance its ability to find flat minima, we propose its new variant named inverse Adam (InvAdam). The key improvement of InvAdam lies in its parameter update mechanism, which is opposite to that of Adam. Specifically, it computes element-wise multiplication of […]

Ver mais

Like 0

Liked Liked

technocracy

Knuth prize call for nominations

digitado ⋅ 3 de March de 2025

Please consider nominating worthy candidates to the Knuth Prize! The prize is awarded “for major research accomplishments and contributions to the foundations of computer science over an extended period of time.” 2025 Knuth Prize Committee:Noga Alon, Edith Cohen (Chair), David Eppstein, Valerie King, Salil Vadhan, and Moshe Vardi. Deadline: March 31 https://www.sigact.org/prizes/knuth.html

Ver mais

Like 0

Liked Liked

technocracy

Deepening our collaboration with the U.S. Department of Energy

digitado ⋅ 18 de December de 2025

OpenAI and the U.S. Department of Energy have signed a memorandum of understanding to deepen collaboration on AI and advanced computing in support of scientific discovery. The agreement builds on ongoing work with national laboratories and helps establish a framework for applying AI to high-impact research across the DOE ecosystem.

Ver mais

Like 0

Liked Liked

technocracy

Flow Matching for Offline Reinforcement Learning with Discrete Actions

digitado ⋅ 9 de February de 2026

arXiv:2602.06138v1 Announce Type: new Abstract: Generative policies based on diffusion models and flow matching have shown strong promise for offline reinforcement learning (RL), but their applicability remains largely confined to continuous action spaces. To address a broader range of offline RL settings, we extend flow matching to a general framework that supports discrete action spaces with multiple objectives. Specifically, we replace continuous flows with continuous-time Markov chains, trained using a Q-weighted flow matching objective. We then extend our […]

Ver mais

Like 0

Liked Liked

technocracy

EMPA: Evaluating Persona-Aligned Empathy as a Process

digitado ⋅ 3 de March de 2026

arXiv:2603.00552v1 Announce Type: new Abstract: Evaluating persona-aligned empathy in LLM-based dialogue agents remains challenging. User states are latent, feedback is sparse and difficult to verify in situ, and seemingly supportive turns can still accumulate into trajectories that drift from persona-specific needs. We introduce EMPA, a process-oriented framework that evaluates persona-aligned support as sustained intervention rather than isolated replies. EMPA distills real interactions into controllable, psychologically grounded scenarios, couples them with an open-ended multi-agent sandbox that exposes strategic adaptation […]

Ver mais

Like 0

Liked Liked

technocracy

Importance inversion transfer identifies shared principles for cross-domain learning

digitado ⋅ 11 de February de 2026

arXiv:2602.09116v1 Announce Type: new Abstract: The capacity to transfer knowledge across scientific domains relies on shared organizational principles. However, existing transfer-learning methodologies often fail to bridge radically heterogeneous systems, particularly under severe data scarcity or stochastic noise. This study formalizes Explainable Cross-Domain Transfer Learning (X-CDTL), a framework unifying network science and explainable artificial intelligence to identify structural invariants that generalize across biological, linguistic, molecular, and social networks. By introducing the Importance Inversion Transfer (IIT) mechanism, the framework prioritizes […]

Ver mais

Like 0

Liked Liked

technocracy

Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions

digitado ⋅ 4 de February de 2026

arXiv:2602.02560v1 Announce Type: new Abstract: Lung cancer remains the leading cause of cancer mortality, driving the development of automated screening tools to alleviate radiologist workload. Standing at the frontier of this effort is Sybil, a deep learning model capable of predicting future risk solely from computed tomography (CT) with high precision. However, despite extensive clinical validation, current assessments rely purely on observational metrics. This correlation-based approach overlooks the model’s actual reasoning mechanism, necessitating a shift to causal verification […]

Ver mais

Like 0

Liked Liked

technocracy

High RAM prices mean record-setting profits for Samsung and other memory makers

digitado ⋅ 8 de January de 2026

Supply shortages and big price increases for RAM and storage have been a major drag for enthusiasts and PC builders in recent months. And while we haven’t yet seen large, widespread price increases for memory-dependent products like pre-built laptop PCs, smartphones, and graphics cards, most companies expect that to change this year if shortages continue. In the meantime, memory manufacturers are riding high demand and high prices to record profits. In revenue guidance released this week, Samsung Electronics […]

Ver mais

Like 0

Liked Liked

technocracy

What’s next after the Trump administration revokes key finding on climate change?

digitado ⋅ 11 de February de 2026

Following three of the warmest years on record, as scientists reckon with climate tipping points and states and cities grapple with the escalating cost of extreme weather and more intense wildfires, the Trump administration this week is expected to formally eliminate the US government’s role in controlling greenhouse gas pollution. By revoking its 17-year-old scientific finding that greenhouse gases endanger public health and welfare, the Environmental Protection Agency will demolish the legal underpinning of its authority to act […]

Ver mais

Like 0

Liked Liked

technocracy

System-Level Performance Modeling of Photonic In-Memory Computing

digitado ⋅ 3 de February de 2026

arXiv:2602.00892v1 Announce Type: new Abstract: Photonic in-memory computing is a high-speed, low-energy alternative to traditional transistor-based digital computing that utilizes high photonic operating frequencies and bandwidths. In this work, we develop a comprehensive system-level performance model for photonic in-memory computing, capturing the effects of key latency sources such as external memory access and opto-electronic conversion. We perform algorithm-to-hardware mapping across a range of workloads, including the Sod shock tube problem, Matricized Tensor Times Khatri-Rao Product (MTTKRP), and the […]

Ver mais

Like 0

Liked Liked