431 to 440 of 3,261 Results
Sep 27, 2024 - S-Lab for Advanced Intelligence
Wu, Tianxing; Si, Chenyang; Jiang, Yuming; Huang, Ziqi; Liu, Ziwei, 2024, "FreeInit: Bridging Initialization Gap in Video Diffusion Models", https://doi.org/10.21979/N9/JMCW1W, DR-NTU (Data), V1
Though diffusion-based video generation has witnessed rapid progress, the inference results of existing models still exhibit unsatisfactory temporal consistency and unnatural dynamics. In this paper, we delve deep into the noise initialization of video diffusion models, and disco... |
Sep 27, 2024 - S-Lab for Advanced Intelligence
Lan, Yushi; Fangzhou Hong; Shuai Yang; Shangchen Zhou; Bo Dai; Xingang Pan; Chen Change Loy, 2024, "LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation", https://doi.org/10.21979/N9/UZ06ZG, DR-NTU (Data), V1
The field of neural rendering has witnessed significant progress with advancements in generative models and differentiable rendering techniques. Though 2D diffusion has achieved success, a unified 3D diffusion pipeline remains unsettled. This paper introduces a novel framework ca... |
Sep 27, 2024 - S-Lab for Advanced Intelligence
Chen, Yongwei; Wang, Tengfei; Wu, Tong; Pan, Xingang; Jia, Kui; Liu, Ziwei, 2024, "ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance", https://doi.org/10.21979/N9/BAZCX6, DR-NTU (Data), V1
Generating high-quality 3D assets from a given image is highly desirable in various applications such as AR/VR. Recent advances in single-image 3D generation explore feed-forward models that learn to infer the 3D model of an object without optimization. Though promising results h... |
Sep 27, 2024 - NIE Data Repository (Harvested)
Rastogi, Rachika, 2025, "Related Data for Thesis/Dissertation: A comparative multimodal analysis of environmental ideologies in two contemporary picturebooks", https://doi.org/10.25340/R4/DXVWY6
Against the backdrop of the existential global environmental crisis and the ambitious targets outlined by the UN Sustainable Development Goals (SDGs), this study investigates the critical significance of picturebooks in shaping childhood understandings of human-nature relationshi...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Sep 26, 2024 - S-Lab for Advanced Intelligence
Tang, Jiaxiang; Chen, Zhaoxi; Chen, Xiaokang; Wang, Tengfei; Zeng, Gang; Liu, Ziwei, 2024, "LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation", https://doi.org/10.21979/N9/27JLJB, DR-NTU (Data), V1
3D content creation has achieved significant progress in terms of both quality and speed. Although current feed-forward models can produce 3D objects in seconds, their resolution is constrained by the intensive computation required during training. In this paper, we introduce Lar... |
Sep 25, 2024 - S-Lab for Advanced Intelligence
Lan, Mengcheng; Chen, Chaofeng; Ke, Yiping; Wang, Xinjiang; Feng, Litong; Zhang, Wayne, 2024, "ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation", https://doi.org/10.21979/N9/YY8L5O, DR-NTU (Data), V1
Open-vocabulary semantic segmentation requires models to effectively integrate visual representations with open-vocabulary semantic labels. While Contrastive Language-Image Pre-training (CLIP) models shine in recognizing visual concepts from text, they often struggle with segment... |
Sep 25, 2024 - S-Lab for Advanced Intelligence
Lan, Mengcheng; Chen, Chaofeng; Ke, Yiping; Wang, Xinjiang; Feng, Litong; Zhang, Wayne, 2024, "ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference", https://doi.org/10.21979/N9/S6NTDJ, DR-NTU (Data), V1
Despite the success of large-scale pretrained Vision-Language Models (VLMs) especially CLIP in various open-vocabulary tasks, their application to semantic segmentation remains challenging, producing noisy segmentation maps with mis-segmented regions. In this paper, we carefully... |
Sep 25, 2024 - XU Bowen
New, T. H.; Xu, Bowen; Shi, Shengxian, 2024, "Replication Data for: Collisions of vortex rings with hemispheres", https://doi.org/10.21979/N9/0MLY8J, DR-NTU (Data), V1
The data support the findings in the paper of "Collisions of vortex rings with hemispheres". |
Sep 25, 2024 - S-Lab for Advanced Intelligence
Yuan, Haobo; Li, Xiangtai; Zhou, Chong; Li, Yining; Chen, Kai; Loy, Chen Change, 2024, "Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively", https://doi.org/10.21979/N9/L05ULT, DR-NTU (Data), V1
The CLIP and Segment Anything Model (SAM) are remarkable vision foundation models (VFMs). SAM excels in segmentation tasks across diverse domains, whereas CLIP is renowned for its zero-shot recognition capabilities. This paper presents an in-depth exploration of integrating these... |
Sep 25, 2024 - S-Lab for Advanced Intelligence
Wu, Tianhao; Zheng, Chuanxia; Wu, Qianyi; Cham, Tat-Jen, 2024, "ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition", https://doi.org/10.21979/N9/RJUHMC, DR-NTU (Data), V1
3D decomposition/segmentation remains a challenge as large-scale 3D annotated data is not readily available. Existing approaches typically leverage 2D machine-generated segments, integrating them to achieve 3D consistency. In this paper, we propose ClusteringSDF, a novel approach... |
