Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"
Dataset for the paper UmeTrack Unified multi-view end-to-end hand tracking for VR
This is the official implementation of CT2Hair High-fidelity 3D Hair Modeling Using Computed Tomography.
This is the repository for TimelineQA, a benchmark for querying lifelogs.
Code for “Pretrained Language Models as Visual Planners for Human Assistance”
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
Official implementation of "Decentralization and Acceleration Enables Large-Scale Bundle Adjustment"
AssemblyHands Toolkit is a Python package that provides data loader, visualization, and evaluation tools for the AssemblyHands dataset (CVPR 2023).
code for paper "Accessing higher dimensions for unsupervised word translation"
The repository for the project A Replication Study of Compositional Generalization Works on Semantic Parsing, accepted into MLRC2022.
Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Code release for "Improved baselines for vision-language pre-training"
Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)
OpenSFEDS, a near-eye gaze estimation dataset containing approximately 2M synthetic camera-photosensor image pairs sampled at 500 Hz under varied appearance and camera position.
PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering queries that require computing aggregates over your data.