digitado

Homogenized Transformers

digitado ⋅ 3 de April de 2026

arXiv:2604.01978v1 Announce Type: cross Abstract: We study a random model of deep multi-head self-attention in which the weights are resampled independently across layers and heads, as at initialization of training. Viewing depth as a time variable, the residual stream defines a discrete-time interacting particle system on the unit sphere. We prove that, under suitable joint scalings of the depth, the residual step size, and the number of heads, this dynamics admits a nontrivial homogenized limit. Depending on the […]

Ver mais

Like 0

Liked Liked

technocracy

Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?

digitado ⋅ 19 de February de 2026

arXiv:2602.15867v1 Announce Type: new Abstract: In this positioning paper, we evaluate the problem-solving and reasoning capabilities of contemporary Large Language Models (LLMs) through their performance in Zork, the seminal text-based adventure game first released in 1977. The game’s dialogue-based structure provides a controlled environment for assessing how LLM-based chatbots interpret natural language descriptions and generate appropriate action sequences to succeed in the game. We test the performance of leading proprietary models – ChatGPT, Claude, and Gemini – under […]

Ver mais

Like 0

Liked Liked

technocracy

How to Use Propensity Score Matching to Measure Down Stream Causal Impact of an Event

digitado ⋅ 21 de January de 2026

Suppose a social media platform’s Ads analytics team wants to know: Does seeing a certain ad (or promoted post) cause users to convert or engage more? This causal question is tricky because users who see the ad might inherently differ from those who don’t. In practice, simply comparing conversion rates of exposed vs. unexposed users can be very misleading. Ad exposure is not randomly assigned – algorithms may show ads more to highly active users, or users self-select into […]

Ver mais

Like 0

Liked Liked

technocracy

Interpolation-Driven Machine Learning Approaches for Plume Shine Dose Estimation: A Comparison of XGBoost, Random Forest, and TabNet

digitado ⋅ 23 de February de 2026

Despite the success of machine learning (ML) in surrogate modeling, its use in radiation dose assessment is limited by safety-critical constraints, scarce training-ready data, and challenges in selecting suitable architectures for physics-dominated systems. Within this context, rapid and accurate plume shine dose estimation serves as a practical test case, as it is critical for nuclear facility safety assessment and radiological emergency response, while conventional photon-transport-based calculations remain computationally expensive. In this work, an interpolation-assisted ML framework was developed […]

Ver mais

Like 0

Liked Liked

technocracy

Zero-Knowledge Proofs and Behavioural Analytics Mitigating Insider Threats in Contemporary Software Ecosystems

digitado ⋅ 9 de April de 2026

Insider threats pose a persistent and evolving challenge to contemporary software ecosystems, where privileged users can exploit access for malicious purposes, often evading traditional perimeter-based defences. This paper introduces a novel hybrid framework that synergistically integrates zero-knowledge proofs (ZKPs) and behavioural analytics to detect and mitigate such threats with enhanced privacy and precision. ZKPs enable secure authentication and data verification without revealing sensitive information, ensuring compliance with privacy regulations like GDPR while thwarting unauthorized access. Complementarily, our behavioural […]

Ver mais

Like 0

Liked Liked

technocracy

Approximating $f$-Divergences with Rank Statistics

digitado ⋅ 2 de February de 2026

arXiv:2601.22784v1 Announce Type: new Abstract: We introduce a rank-statistic approximation of $f$-divergences that avoids explicit density-ratio estimation by working directly with the distribution of ranks. For a resolution parameter $K$, we map the mismatch between two univariate distributions $mu$ and $nu$ to a rank histogram on ${ 0, ldots, K}$ and measure its deviation from uniformity via a discrete $f$-divergence, yielding a rank-statistic divergence estimator. We prove that the resulting estimator of the divergence is monotone in $K$, […]

Ver mais

Like 0

Liked Liked

technocracy

Top 20 AI Projects to Improve Your Skills in ML

digitado ⋅ 19 de April de 2023

In this article, we give a comprehensive overview of AI project ideas, from beginner-friendly to more advanced challenges. By working on these AI projects, you can gain valuable skills in machine learning research and development, build a portfolio, and make your contribution to the fast-developing field of artificial intelligence. One of the great things about AI is that it is a highly interdisciplinary field. There is no one-size-fits-all approach to AI projects. No matter which background you have, […]

Ver mais

Like 0

Liked Liked

technocracy

The life of a prescription at Amazon Pharmacy

digitado ⋅ 30 de September de 2024

The life of a prescription at Amazon Pharmacy From pricing estimation and regulatory compliance to inventory management and chatbot assistants, machine learning models help Amazon Pharmacy customers stay healthy and save time and money. Conversational AI Alexandre Alves Anita Vila September 30, 01:32 PM October 02, 11:42 AM Pharmacies play a vital role in ensuring patients health, but the process of dispensing medications is far more complex than it may appear. At Amazon Pharmacy, we are using artificial […]

Ver mais

Like 0

Liked Liked

technocracy

Domain Adaptation Without the Compute Burden for Efficient Whole Slide Image Analysis

digitado ⋅ 18 de March de 2026

arXiv:2603.15774v1 Announce Type: new Abstract: Computational methods on analyzing Whole Slide Images (WSIs) enable early diagnosis and treatments by supporting pathologists in detection and classification of tumors. However, the extremely high resolution of WSIs makes end-to-end training impractical compared to typical image analysis tasks. To address this, most approaches use pre-trained feature extractors to obtain fixed representations of whole slides, which are then combined with Multiple Instance Learning (MIL) for downstream tasks. These feature extractors are typically pre-trained […]

Ver mais

Like 0

Liked Liked

technocracy

From Simulation to Deep Learning: Survey on Network Performance Modeling Approaches

digitado ⋅ 30 de March de 2026

Network performance modeling is a field that predates early computer networks and the beginning of the Internet. It aims to predict the traffic performance of packet flows in a given network. Its applications range from network planning and troubleshooting to feeding information to network controllers for configuration optimization. Traditional network performance modeling has relied heavily on Discrete Event Simulation (DES) and analytical methods grounded in mathematical theories such as Queuing Theory and Network Calculus. However, as of late, […]

Ver mais

Like 0

Liked Liked