April 2026

Exploration Hacking: Can LLMs Learn to Resist RL Training?

digitado ⋅ 30 de April de 2026

Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Successful RL relies on sufficient exploration of diverse actions by the model during training, which creates a potential failure mode: a model could strategically alter its exploration during training to influence the subsequent training outcome. In this paper we study this behavior, called exploration hacking. First, we create model organisms of selective RL resistance by fine-tuning LLMs to […]

Ver mais

Like 0

Liked Liked

technocracy

PhyCo: Learning Controllable Physical Priors for Generative Motion

digitado ⋅ 30 de April de 2026

Modern video diffusion models excel at appearance synthesis but still struggle with physical consistency: objects drift, collisions lack realistic rebound, and material responses seldom match their underlying properties. We present PhyCo, a framework that introduces continuous, interpretable, and physically grounded control into video generation. Our approach integrates three key components: (i) a large-scale dataset of over 100K photorealistic simulation videos where friction, restitution, deformation, and force are systematically varied across diverse scenarios; (ii) physics-supervised fine-tuning of a pretrained […]

Ver mais

Like 0

Liked Liked

technocracy

Mapping the Phase Diagram of the Vicsek Model with Machine Learning

digitado ⋅ 30 de April de 2026

In this study, we use machine learning to classify and interpolate the phase structure of the Vicsek flocking model across the three-dimensional parameter space $(η,ρ,v_0)$. We construct a dataset of simulated parameter points and characterize each point using long-time dynamical observables. These observables are then used as inputs to a K-Means clustering procedure, which assigns each point to a disorder, order, or coexistence phase. Using these clustered labels, we train a neural-network classifier to learn the mapping from […]

Ver mais

Like 0

Liked Liked

technocracy

AWS Generative AI Model Agility Solution: A comprehensive guide to migrating LLMs for generative AI production

digitado ⋅ 30 de April de 2026

Maintaining model agility is crucial for organizations to adapt to technological advancements and optimize their artificial intelligence (AI) solutions. Whether transitioning between different large language model (LLM) families or upgrading to newer versions within the same family, a structured migration approach and a standardized process are essential for facilitating continuous performance improvement while minimizing operational disruptions. However, developing such a solution is challenging in both technical and non-technical aspects because the solution needs to: Be generic to cover a […]

Ver mais

Like 0

Liked Liked

technocracy

Sun Finance automates ID extraction and fraud detection with generative AI on AWS

digitado ⋅ 30 de April de 2026

This post was co-authored with Krišjānis Kočāns, Kaspars Magaznieks, Sergei Kiriasov from Sun Finance Group If you process identity documents at scale—loan applications, account openings, compliance checks—you’ve likely hit the same wall: traditional optical character recognition (OCR) gets you partway there, but extraction errors still push a large share of applications into manual review queues. Add fraud detection to the mix, and the manual workload compounds. Sun Finance, a Latvian fintech founded in 2017, operates as a technology-first […]

Ver mais

Like 0

Liked Liked

technocracy

Unleashing Agentic AI Analytics on Amazon SageMaker with Amazon Athena and Amazon Quick

digitado ⋅ 30 de April de 2026

Modern enterprises face mounting challenges in extracting actionable insights from vast data lakes and lakehouses spanning petabytes of structured and unstructured data. Traditional analytics require specialized technical expertise in SQL, data modeling, and business intelligence tools, creating bottlenecks that slow decision-making across retail, financial services, healthcare, Travel & Hospitality, manufacturing and many more industries. This architecture demonstrates how agentic AI assistant from Amazon Quick transform data analytics into a self-service capability. It showcases enabling business users to query […]

Ver mais

Like 0

Liked Liked

technocracy

Configuring Amazon Bedrock AgentCore Gateway for secure access to private resources

digitado ⋅ 30 de April de 2026

AI agents in production environments often need to reach internal APIs, databases, and private resources that sit behind Amazon Virtual Private Cloud (Amazon VPC) boundaries. Managing private connectivity for each agent-to-tool path adds operational overhead and slows deployment. Amazon Bedrock AgentCore VPC connectivity is designed to deploy AI agents and Model Context Protocol (MCP) servers without requiring the network traffic to be exposed to the public internet. This capability extends to managed Amazon VPC egress for Amazon Bedrock […]

Ver mais

Like 0

Liked Liked

technocracy

A Unified Framework of Hyperbolic Graph Representation Learning Methods

digitado ⋅ 30 de April de 2026

Hyperbolic geometry has emerged as an effective latent space for representing complex networks, owing to its ability to capture hierarchical organization and heterogeneous connectivity patterns using low-dimensional embeddings. As a result, numerous hyperbolic graph representation learning methods have been proposed in recent years. However, their practical adoption and systematic comparison remain challenging, as implementations are fragmented and shared tools for reproducible and fair evaluation are lacking. In this work, we introduce a unified open-source framework for hyperbolic graph […]

Ver mais

Like 0

Liked Liked

technocracy

What LG and NVIDIA’s talks reveal about the future of physical AI

digitado ⋅ 30 de April de 2026

LG is currently engaged in exploratory discussions with NVIDIA concerning physical AI, data centres, and mobility. Following a meeting in Seoul between LG CEO Ryu Jae-cheol and Madison Huang, Senior Director of Product Marketing for Omniverse and Robotics at NVIDIA, the core operational dependencies required to run complex automated systems are becoming apparent. While the companies have not formalised investment amounts or timelines, their intersecting hardware and processing priorities highlight the massive capital expenditure required to bring autonomous […]

Ver mais

Like 0

Liked Liked

technocracy

Early Detection of Water Stress by Plant Electrophysiology: Machine Learning for Irrigation Management

digitado ⋅ 30 de April de 2026

Purpose: Fast detection of plant stress is key to plant phenotyping, precision agriculture, and automated crop management. In particular, efficient irrigation management requires early identification of water stress to optimize resource use while maintaining crop performance. Direct physiological sensing offers the potential to detect stress responses before visible symptoms appear. Methods: In this study, we recorded electrophysiological signals from greenhouse-grown tomato plants subjected to water stress and developed a framework based on machine learning for online stress detection. […]

Ver mais

Like 0

Liked Liked