FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation Paper • 2312.04484 • Published Dec 7, 2023
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes Paper • 2501.04004 • Published 28 days ago • 1
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper • 2501.04003 • Published 28 days ago • 25
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Paper • 2501.04005 • Published 28 days ago
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Paper • 2501.00326 • Published Dec 31, 2024 • 1
FlexEvent: Event Camera Object Detection at Arbitrary Frequencies Paper • 2412.06708 • Published Dec 9, 2024
Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective Paper • 2208.07365 • Published Aug 15, 2022
4D Contrastive Superflows are Dense 3D Representation Learners Paper • 2407.06190 • Published Jul 8, 2024
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding Paper • 2412.04383 • Published Dec 5, 2024 • 4
TinyFusion: Diffusion Transformers Learned Shallow Paper • 2412.01199 • Published Dec 2, 2024 • 14
OminiControl: Minimal and Universal Control for Diffusion Transformer Paper • 2411.15098 • Published Nov 22, 2024 • 55
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Paper • 2410.18084 • Published Oct 23, 2024 • 13
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper • 2409.17481 • Published Sep 26, 2024 • 47
Vista3D: Unravel the 3D Darkside of a Single Image Paper • 2409.12193 • Published Sep 18, 2024 • 10
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally Paper • 2409.08270 • Published Sep 12, 2024 • 10
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis Paper • 2408.09481 • Published Aug 18, 2024 • 1
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision Paper • 2309.14181 • Published Sep 25, 2023 • 2
Towards Explainable In-the-Wild Video Quality Assessment: A Database and a Language-Prompted Approach Paper • 2305.12726 • Published May 22, 2023