January 2026

TSSR: Two-Stage Swap-Reward-Driven Reinforcement Learning for Character-Level SMILES Generation

digitado ⋅ 8 de January de 2026

The design of reliable, valid, and diverse molecules is fundamental to modern drug discovery, as improved molecular generation supports efficient exploration of the chemical space for potential drug candidates and reduces the cost of early design efforts. Despite these needs, current chemical language models that generate molecules as SMILES strings are vulnerable to compounding token errors: many samples are unparseable or chemically implausible, and hard constraints meant to prevent failure can restrict exploration. To address this gap, we […]

Ver mais

Like 0

Liked Liked

technocracy

Integrating Distribution Matching into Semi-Supervised Contrastive Learning for Labeled and Unlabeled Data

digitado ⋅ 8 de January de 2026

The advancement of deep learning has greatly improved supervised image classification. However, labeling data is costly, prompting research into unsupervised learning methods such as contrastive learning. In real-world scenarios, fully unlabeled datasets are rare, making semi-supervised learning (SSL) highly relevant in scenarios where a small amount of labeled data coexists with a large volume of unlabeled data. A well-known semi-supervised contrastive learning approach involves assigning pseudo-labels to unlabeled data. This study aims to enhance pseudo-label-based SSL by incorporating […]

Ver mais

Like 0

Liked Liked

technocracy

Multiagent Reinforcement Learning with Neighbor Action Estimation

digitado ⋅ 8 de January de 2026

Multiagent reinforcement learning, as a prominent intelligent paradigm, enables collaborative decision-making within complex systems. However, existing approaches often rely on explicit action exchange between agents to evaluate action value functions, which is frequently impractical in real-world engineering environments due to communication constraints, latency, energy consumption, and reliability requirements. From an artificial intelligence perspective, this paper proposes an enhanced multiagent reinforcement learning framework that employs action estimation neural networks to infer agent behaviors. By integrating a lightweight action estimation […]

Ver mais

Like 0

Liked Liked

technocracy

NASA considers evacuating ailing crew member from International Space Station

digitado ⋅ 8 de January de 2026

Someone on the International Space Station suffered an unspecified “medical situation” Wednesday, prompting the postponement of a planned spacewalk and raising the possibility of an early return for a portion of the lab’s seven-person crew, NASA said in a statement. NASA has never ordered a medical evacuation from space before, but the option has always been available at the International Space Station with lifeboats ready for activation. The space agency announced the spacewalk postponement Wednesday afternoon due to […]

Ver mais

Like 0

Liked Liked

technocracy

Hybrid Federated Learning for Noise-Robust Training

digitado ⋅ 8 de January de 2026

Federated learning (FL) and federated distillation (FD) are distributed learning paradigms that train UE models with enhanced privacy, each offering different trade-offs between noise robustness and learning speed. To mitigate their respective weaknesses, we propose a hybrid federated learning (HFL) framework in which each user equipment (UE) transmits either gradients or logits, and the base station (BS) selects the per-round weights of FL and FD updates. We derive convergence of HFL framework and introduce two methods to exploit […]

Ver mais

Like 0

Liked Liked

technocracy

Prediction of Cellular Malignancy Using Electrical Impedance Signatures and Supervised Machine Learning

digitado ⋅ 8 de January de 2026

Bioelectrical properties of cells such as relative permittivity, conductivity, and characteristic time constants vary significantly between healthy and malignant cells across different frequencies. These distinctions provide a promising foundation for diagnostic and classification applications. This study systematically reviewed 33 scholarly articles to compile datasets of quantitative bioelectric parameters and evaluated their utility in predictive modeling. Three supervised machine learning algorithms- Random Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbor (KNN) were implemented and tuned using key hyperparameters […]

Ver mais

Like 0

Liked Liked

technocracy

I built an open-source 3D soccer game for Reinforcement Learning experiments

digitado ⋅ 8 de January de 2026

https://preview.redd.it/2wxhkzftz0cg1.png?width=2558&format=png&auto=webp&s=8b0be30b0534dde5687b9f958eef97d25f015377 I wanted to get into reinforcement learning but couldn’t find a game environment that clicked with me. Inspired by AI Warehouse videos, I decided to build my own. Cube Soccer 3D is a minimalist soccer game where cube players with googly eyes compete to score goals. It’s designed specifically as an RL training environment. Tech stack: – Rust + Bevy (game engine) – Rapier3D (physics) – Modular architecture for easy RL integration – Gymnasium-compatible Python bindings Features: […]

Ver mais

Like 0

Liked Liked

technocracy

Concept Tokens: Learning Behavioral Embeddings Through Concept Definitions

digitado ⋅ 8 de January de 2026

We propose Concept Tokens, a lightweight method that adds a new special token to a pretrained LLM and learns only its embedding from multiple natural language definitions of a target concept, where occurrences of the concept are replaced by the new token. The LLM is kept frozen and the embedding is optimized with the standard language-modeling objective. We evaluate Concept Tokens in three settings. First, we study hallucinations in closed-book question answering on HotpotQA and find a directional […]

Ver mais

Like 0

Liked Liked

technocracy

Using Large Language Models to Detect Socially Shared Regulation of Collaborative Learning

digitado ⋅ 8 de January de 2026

The field of learning analytics has made notable strides in automating the detection of complex learning processes in multimodal data. However, most advancements have focused on individualized problem-solving instead of collaborative, open-ended problem-solving, which may offer both affordances (richer data) and challenges (low cohesion) to behavioral prediction. Here, we extend predictive models to automatically detect socially shared regulation of learning (SSRL) behaviors in collaborative computational modeling environments using embedding-based approaches. We leverage large language models (LLMs) as summarization […]

Ver mais

Like 0

Liked Liked

technocracy

Ford is getting ready to put AI assistants in its cars

digitado ⋅ 8 de January de 2026

The annual Consumer Electronics Show is currently raging in Las Vegas, and as has become traditional over the past decade, automakers and their suppliers now use the conference to announce their technology plans. Tonight it was Ford’s turn, and it is very on-trend for 2026. If you guessed that means AI is coming to the Ford in-car experience, congratulations, you guessed right. Even though the company owes everything to mass-producing identical vehicles, it says that it wants AI […]

Ver mais

Like 0

Liked Liked