digitado

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

digitado ⋅ 4 de March de 2026

arXiv:2509.22613v2 Announce Type: replace-cross Abstract: Recent reinforcement learning (RL) methods have substantially enhanced the planning capabilities of Large Language Models (LLMs), yet the theoretical basis for their effectiveness remains elusive. In this work, we investigate RL’s benefits and limitations through a tractable graph-based abstraction, focusing on policy gradient (PG) and Q-learning methods. Our theoretical analyses reveal that supervised fine-tuning (SFT) may introduce co-occurrence-based spurious solutions, whereas RL achieves correct planning primarily through exploration, underscoring exploration’s role in enabling […]

Ver mais

Like 0

Liked Liked

technocracy

Impact of Clustering on the Observability and Controllability of Complex Networks

digitado ⋅ 6 de January de 2026

arXiv:2601.00221v1 Announce Type: new Abstract: The increasing complexity and interconnectedness of systems across various fields have led to a growing interest in studying complex networks, particularly Scale-Free (SF) networks, which best model real-world systems. This paper investigates the influence of clustering on the observability and controllability of complex SF networks, framing these characteristics in the context of structured systems theory. In this paper, we show that densely clustered networks require fewer driver and observer nodes due to better […]

Ver mais

Like 0

Liked Liked

technocracy

Meta’s AI chief scientist leaves with parting shots

digitado ⋅ 5 de January de 2026

Read Online | Sign Up | Advertise Good morning, {{ first_name | AI enthusiasts }}. Acclaimed AI chief scientist Yann LeCun just departed Meta after over a decade, and the outspoken researcher definitely didn’t leave quietly. From calling his boss Alexandr Wang “inexperienced” to admitting Llama 4 benchmarks were “fudged,” LeCun’s parting shots in a candid interview encapsulate the tension between Meta’s old guard and new AI direction. In today’s AI rundown: LeCun blasts Meta’s AI leadership on […]

Ver mais

Like 0

Liked Liked

technocracy

Distributed Semi-Speculative Parallel Anisotropic Mesh Adaptation

digitado ⋅ 18 de February de 2026

arXiv:2602.15204v1 Announce Type: new Abstract: This paper presents a distributed memory method for anisotropic mesh adaptation that is designed to avoid the use of collective communication and global synchronization techniques. In the presented method, meshing functionality is separated from performance aspects by utilizing a separate entity for each – a multicore cc-NUMA-based (shared memory) mesh generation software and a parallel runtime system that is designed to help applications leverage the concurrency offered by emerging high-performance computing (HPC) architectures. […]

Ver mais

Like 0

Liked Liked

technocracy

Comparative Analysis of Neural Retriever-Reranker Pipelines for Retrieval-Augmented Generation over Knowledge Graphs in E-commerce Applications

digitado ⋅ 27 de February de 2026

arXiv:2602.22219v1 Announce Type: new Abstract: Recent advancements in Large Language Models (LLMs) have transformed Natural Language Processing (NLP), enabling complex information retrieval and generation tasks. Retrieval-Augmented Generation (RAG) has emerged as a key innovation, enhancing factual accuracy and contextual grounding by integrating external knowledge sources with generative models. Although RAG demonstrates strong performance on unstructured text, its application to structured knowledge graphs presents challenges: scaling retrieval across connected graphs and preserving contextual relationships during response generation. Cross-encoders refine […]

Ver mais

Like 0

Liked Liked

technocracy

Is Society Just a Really Complicated Brain?

digitado ⋅ 10 de February de 2026

What makes a brain? Everyone has an answer, but almost no one has a precise one. A friend or a stranger on the street may be able to tell you that the brain makes us feel pain, bliss, or everything in between. Some might be able to tell you that it helps us regulate balance, and some might say that it bridges our senses and the “subconscious.” To me, the brain is intriguing precisely because it’s a bodily […]

Ver mais

Like 0

Liked Liked

technocracy

Learning dynamics from online-offline systems of LLM agents

digitado ⋅ 2 de March de 2026

arXiv:2602.23437v1 Announce Type: new Abstract: Online information is increasingly linked to real-world instability, especially as automated accounts and LLM-based agents help spread and amplify news. In this work, we study how information spreads on networks of Large Language Models (LLMs) using mathematical models. We investigate how different types of offline events, along with the “personalities” assigned to the LLMs, affect the network dynamics of online information spread of the events among the LLMs. We introduce two models: 1) […]

Ver mais

Like 0

Liked Liked

technocracy

Nonparametric Tree Graphical Models

digitado ⋅ 31 de March de 2010

We introduce a nonparametric representation for graphical model on trees which expresses marginals as Hilbert space embeddings and conditionals as embedding operators. This formulation allows us to define a graphical model solely on the basis of the feature space representation of its variables. Thus, this nonparametric model can be applied to general domains where kernels are defined, handling challenging cases such as discrete variables whose domains are huge, or very complex, non-Gaussian continuous distributions. We also derive kernel […]

Ver mais

Like 0

Liked Liked

technocracy

Synthetic Interaction Data for Scalable Personalization in Large Language Models

digitado ⋅ 16 de February de 2026

arXiv:2602.12394v1 Announce Type: new Abstract: Personalized prompting offers large opportunities for deploying large language models (LLMs) to diverse users, yet existing prompt optimization methods primarily focus on task-level optimization while largely overlooking user-specific preferences and latent constraints of individual users. This gap is primarily due to (i) the absence of high-quality, privacy-sensitive data that capture personalized user-LLM interactions at scale, and (ii) the lack of robust reward signals for individual preferences. To overcome existing data limitations, we introduce […]

Ver mais

Like 0

Liked Liked

technocracy

Comparison of Credential Management Systems Based on the Standards of IEEE, ETSI, and YD/T 3957-2021

digitado ⋅ 5 de March de 2026

arXiv:2603.03376v1 Announce Type: new Abstract: As V2X (Vehicle-to-Everything) technology becomes increasingly prevalent, the security of V2X networks has garnered growing attention worldwide. In North America, the IEEE 1609 series standards are primarily used, while Europe adopts the ETSI series standards, and China has also established its industry standard, YD/T 3957-2021, among others. Although these standards share some commonalities, they also exhibit differences. To achieve compatibility across these standards, analyzing their similarities and differences is a crucial issue. Therefore, […]

Ver mais

Like 0

Liked Liked