Any implementations similar to D4RT? [D]

digitado ⋅ 10 de May de 2026

Deepmind released a paper on D4RT at the start of this year which crucially enabled a “4D” understanding of the world via structure from motion and generating:
1. Point cloud reconstruction from 2D videos (not static scenes)
2. Camera pose estimation

You could pass in a video of a dog walking on a beach and it would estimate the 3d representation of the beach and the dog at any point in time.

They did not release the model though. Are there any open source, available implementations of anything similar now?

submitted by /u/reddysteady
[link] [comments]

Like 0

Liked Liked