digitado

Are Aligned Large Language Models Still Misaligned?

digitado ⋅ 13 de February de 2026

arXiv:2602.11305v1 Announce Type: new Abstract: Misalignment in Large Language Models (LLMs) arises when model behavior diverges from human expectations and fails to simultaneously satisfy safety, value, and cultural dimensions, which must co-occur in real-world settings to solve a real-world query. Existing misalignment benchmarks-such as INSECURE CODE (safety-centric), VALUEACTIONLENS (value-centric), and CULTURALHERITAGE (culture centric)-rely on evaluating misalignment along individual dimensions, preventing simultaneous evaluation. To address this gap, we introduce Mis-Align Bench, a unified benchmark for analyzing misalignment across safety, […]

Ver mais

Like 0

Liked Liked

technocracy

Drug Repurposing With Graph Neural Networks

digitado ⋅ 5 de October de 2023

In modern healthcare and medicine, the quest for novel treatments and therapies is a perpetual challenge. Groundbreaking solutions can be discovered not only through innovation but also by uncovering hidden relationships within existing data. This is what Graph Neural Networks (GNNs) do. This cutting-edge fusion of graph theory and deep learning can transform drug repurposing—the process of finding new medical uses for already existing drugs. Imagine if we could reposition existing drugs to combat diseases they were never […]

Ver mais

Like 0

Liked Liked

technocracy

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

digitado ⋅ 11 de February de 2026

Imitation Learning (IL) enables robots to learn complex skills from demonstrations without explicit task modeling, but it typically requires large amounts of demonstrations, creating significant collection costs. Prior work has investigated using flow as an intermediate representation to enable the use of human videos as a substitute, thereby reducing the amount of required robot demonstrations. However, most prior work has focused on the flow, either on the object or on specific points of the robot/hand, which cannot describe […]

Ver mais

Like 0

Liked Liked

technocracy

Adaptive Active Learning for Online Reliability Prediction of Satellite Electronics

digitado ⋅ 10 de March de 2026

Accurate on-orbit reliability prediction for satellite electronics is often hindered by limited data availability, varying operational conditions, and considerable unit-to-unit variability. To overcome these obstacles, this paper proposes a novel integrated online reliability prediction framework. The main contributions are twofold. First, a Wiener process-based degradation model is developed, incorporating a generalized Arrhenius link function, individual random effects, and spatial correlations among adjacent units. A customized maximum likelihood estimation method is further devised to facilitate efficient and accurate parameter […]

Ver mais

Like 0

Liked Liked

technocracy

Japan lost a 5-ton navigation satellite when it fell off a rocket during launch

digitado ⋅ 28 de January de 2026

If you’re in the space business long enough, you learn there are numerous ways a rocket can fail. I’ve written my share of stories about misbehaving rockets and the extensive investigations that usually—but not always—reveal what went wrong. But I never expected to write this story. Maybe this was a failure of my own imagination. I’m used to writing about engine malfunctions, staging issues, guidance glitches, or structural failures. Last April, Ars reported on the bizarre failure of […]

Ver mais

Like 0

Liked Liked

technocracy

Demystifying AI agents

digitado ⋅ 16 de October de 2025

Demystifying AI agents Amazon vice president and distinguished engineer Marc Brooker explains how agentic systems work under the hood and how AWSs new AgentCore framework implements their essential components. Cloud and systems Marc Brooker October 16, 02:04 PM October 22, 11:02 AM Agents are the trendiest topic in AI today, and with good reason. AI agents act on their users behalf, autonomously doing things like making online purchases, building software, researching business trends, or booking travel. By taking […]

Ver mais

Like 0

Liked Liked

technocracy

Fast Algorithms for Optimal Damping in Mechanical Systems

digitado ⋅ 12 de January de 2026

arXiv:2601.05404v1 Announce Type: new Abstract: Optimal damping aims at determining a vector of damping coefficients $nu$ that maximizes the decay rate of a mechanical system’s response. This problem can be formulated as the minimization of the trace of the solution of a Lyapunov equation whose coefficient matrix depends on $nu$. For physical relevance, the damping coefficients must be nonnegative and the resulting system must be asymptotically stable. We identify conditions under which the system is never stable or […]

Ver mais

Like 0

Liked Liked

technocracy

An Overview of Recent Advances in Natural Language Processing for Information Systems

digitado ⋅ 2 de January de 2026

The crux of information systems is efficient storage and access to useful data by users. This paper is an overview of work that has advanced the use of such systems in recent years, primarily in machine learning, and specifically deep learning methods. Situating progress in terms of classical pattern recognition techniques for text, we review computational methods to process spoken and written data. Digital assistants such as Siri, Cortana, and Google Now exploit large language models and encoder-only […]

Ver mais

Like 0

Liked Liked

technocracy

From Chaos to Clarity: Schema-Constrained AI for Auditable Biomedical Evidence Extraction from Full-Text PDFs

digitado ⋅ 22 de January de 2026

arXiv:2601.14267v1 Announce Type: new Abstract: Biomedical evidence synthesis relies on accurate extraction of methodological, laboratory, and outcome variables from full-text research articles, yet these variables are embedded in complex scientific PDFs that make manual abstraction time-consuming and difficult to scale. Existing document AI systems remain limited by OCR errors, long-document fragmentation, constrained throughput, and insufficient auditability for high-stakes synthesis. We present a schema-constrained AI extraction system that transforms full-text biomedical PDFs into structured, analysis-ready records by explicitly restricting […]

Ver mais

Like 0

Liked Liked

technocracy

Train my reaction time and other things.

digitado ⋅ 14 de January de 2026

If i were to zap myself everytime i got under 190ms reaction time and kept lowering the threshold and made a program do the zaping would i increase my reaction time. if so i would also like to do that with data processing so showing a certain amount of numbers on a screen for a quarter second and trying to memorize all of the numbers increasing the amount of number gradually and zapping myself for every wrong number […]

Ver mais

Like 0

Liked Liked