digitado – Page 330

Improving Machine Learning Performance with Synthetic Augmentation

digitado ⋅ 16 de April de 2026

Synthetic augmentation is increasingly used to mitigate data scarcity in financial machine learning, yet its statistical role remains poorly understood. We formalize synthetic augmentation as a modification of the effective training distribution and show that it induces a structural bias–variance trade-off: while additional samples may reduce estimation error, they may also shift the population objective whenever the synthetic distribution deviates from regions relevant under evaluation. To isolate informational gains from mechanical sample-size effects, we introduce a size-matched null […]

Ver mais

Like 0

Liked Liked

technocracy

Zero Trust in the Context of IoT: Industrial Literature Review, Trends, and Challenges

digitado ⋅ 9 de April de 2026

arXiv:2604.06272v1 Announce Type: new Abstract: The Zero-trust (ZT) model is an increasingly popular model that relies on the idea that no trust should be granted to any entity (network, persons, devices) by default. ZT model is gaining attention from both research and practice, with various levels of adequation between research developed and real-life applications. NIST provided a standard to fulfill requirements of ZT architecture of network core but many practical aspects remain unspecified, some of them requiring solving […]

Ver mais

Like 0

Liked Liked

technocracy

Measuring Inclusion in Interaction: Inclusion Analytics for Human-AI Collaborative Learning

digitado ⋅ 11 de February de 2026

arXiv:2602.09269v1 Announce Type: new Abstract: Inclusion, equity, and access are widely valued in AI and education, yet are often assessed through coarse sample descriptors or post-hoc self-reports that miss how inclusion is shaped moment by moment in collaborative problem solving (CPS). In this proof-of-concept paper, we introduce inclusion analytics, a discourse-based framework for examining inclusion as a dynamic, interactional process in CPS. We conceptualize inclusion along three complementary dimensions — participation equity, affective climate, and epistemic equity — […]

Ver mais

Like 0

Liked Liked

technocracy

ContextCite: Attributing Model Generation to Context

digitado ⋅ 6 de May de 2024

Code Demo Paper Language models may need external information to provide a response to a given query. A user would provide this information to a language model as context and then expect the model to interact with this context when responding to the query. For example, suppose that I want to use an AI assistant like ChatGPT to help me plan a trip to see a solar eclipse this week. I would first […]

Ver mais

Like 0

Liked Liked

technocracy

Notes on the Reward Representation of Posterior Updates

digitado ⋅ 4 de February de 2026

arXiv:2602.02912v1 Announce Type: cross Abstract: Many ideas in modern control and reinforcement learning treat decision-making as inference: start from a baseline distribution and update it when a signal arrives. We ask when this can be made literal rather than metaphorical. We study the special case where a KL-regularized soft update is exactly a Bayesian posterior inside a single fixed probabilistic model, so the update variable is a genuine channel through which information is transmitted. In this regime, behavioral […]

Ver mais

Like 0

Liked Liked

technocracy

Developing an LLM: Building, Training, Finetuning

digitado ⋅ 2 de June de 2024

This is an overview of the LLM development process. This one-hour talk focuses on the essential three stages of developing an LLM: coding the architecture, implementing pretraining, and fine-tuning the LLM. Lastly, we also discuss the main ways LLMs are evaluated, along with the caveats of each method.

Ver mais

Like 0

Liked Liked

technocracy

MARS: Unleashing the Power of Speculative Decoding via Margin-Aware Verification

digitado ⋅ 23 de January de 2026

arXiv:2601.15498v1 Announce Type: new Abstract: Speculative Decoding (SD) accelerates autoregressive large language model (LLM) inference by decoupling generation and verification. While recent methods improve draft quality by tightly coupling the drafter with the target model, the verification mechanism itself remains largely unchanged, relying on strict token-level rejection sampling. In practice, modern LLMs frequently operate in low-margin regimes where the target model exhibits weak preference among top candidates. In such cases, rejecting plausible runner-up tokens yields negligible information gain […]

Ver mais

Like 0

Liked Liked

technocracy

Contact-Grounded Policy: Dexterous Visuotactile Policy with Generative Contact Grounding

digitado ⋅ 9 de March de 2026

arXiv:2603.05687v1 Announce Type: new Abstract: Contact-Grounded Policy (CGP) enables fine-grained, contact-rich dexterous manipulation by grounding multi-point contacts through predicting the actual robot state and tactile feedback, and by using a learned contact-consistency mapping to convert these predictions into controller-executable targets for a compliance controller. CGP supports both dense tactile arrays and vision-based tactile sensors mounted on the hand. We collect demonstrations via teleoperation in both simulation and on a physical robot, and evaluate CGP across multiple dexterous manipulation […]

Ver mais

Like 0

Liked Liked

technocracy

Samsung Galaxy S26 Ultra review: Private and performant

digitado ⋅ 17 de March de 2026

Samsung is nothing if not consistent. Just as it has for many years, the company is starting the year with a new generation of Galaxy S phones. Rumors about remixing the lineup did not pan out, so there are still three versions of the phone—the Galaxy S26, S26 Plus, and S26 Ultra. It’s the Ultra, with its whopping $1,300 price tag, that makes up the largest chunk of Samsung flagship sales, even though you can get a perfectly […]

Ver mais

Like 0

Liked Liked

technocracy

Real-Time Sports Action Recognition Using a CNN–Transformer Hybrid Deep Learning Framework

digitado ⋅ 2 de April de 2026

The rapid expansion of sports broadcasting and digital media platforms has increased the demand for intelligent systems capable of automatically identifying important sports events for real-time analytics and highlight generation. Manual annotation of sports videos requires significant time and effort and may introduce human errors during analysis. This paper presents a real-time sports action recognition framework using a hybrid CNN–Transformer architecture for detecting critical events in football and cricket videos. The proposed system processes live or recorded video […]

Ver mais

Like 0

Liked Liked