March 2021 – digitado

When are Neural Networks more powerful than Neural Tangent Kernels?

digitado ⋅ 25 de March de 2021

The empirical success of deep learning has posed significant challenges to machine learning theory: Why can we efficiently train neural networks with gradient descent despite its highly non-convex optimization landscape? Why do over-parametrized networks generalize well? The recently proposed Neural Tangent Kernel (NTK) theory offers a powerful framework for understanding these, but yet still comes with its limitations. In this blog post, we explore how to analyze wide neural networks beyond the NTK theory, based on our recent […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond log-concave sampling (Part 3)

digitado ⋅ 12 de March de 2021

In the first post of this series, we introduced the challenges of sampling distributions beyond log-concavity. In Part 2 we tackled sampling from multimodal distributions: a typical obstacle occuring in problems involving statistical inference and posterior sampling in generative models. In this (final) post of the series, we consider sampling in the presence of manifold structure in the level sets of the distribution – which also frequently manifests in the same settings. It will cover the paper Fast […]

Ver mais

Like 0

Liked Liked

technocracy

Beyond log-concave sampling (Part 2)

digitado ⋅ 1 de March de 2021

In our previous blog post, we introduced the challenges of sampling distributions beyond log-concavity. We first introduced the problem of sampling from a distibution $p(x) propto e^{-f(x)}$ given value or gradient oracle access to $f$, as an analogous problem to black-box optimization with oracle access. We introduced the natural algorithm for sampling in this setup: Langevin Monte Carlo, a Markov Chain reminiscent of noisy gradient descent, [x_{t+eta} = x_t – eta nabla f(x_t) + sqrt{2eta}xi_t,quad xi_tsim N(0,I).] Finally, […]

Ver mais

Like 0

Liked Liked