S-Lab for Advanced Intelligence, established in 2020, is a university laboratory at NTU focusing on research and development of cutting-edge AI technologies in computer vision, natural language processing, reinforcement learning, deep learning, and distributed computing. We aim to create impactful applications spanning various strategic areas in partnership with academic, industry, and government organizations.
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 10 of 85 Results
Apr 28, 2026
Xu, Yuanmu; Hou, Guanli; Hu, Jiangbei; Ren, Tenglong; Wang, Xiaokun; Zhang, Yalan; Ban, Xiaojuan; Qian, Chen; Hou, Fei; He, Ying, 2025, "PGA-NeuS: Physics and Geometry-Augmented Neural Implicit Surfaces for Rigid Bodies", https://doi.org/10.21979/N9/LTXKFL, DR-NTU (Data), V2
This paper tackles the challenges of physics-based simulation of rigid bodies in neural rendering, focusing on 3D model representation and collision handling. A synthetic and real-world dataset is also included in the paper.
Compressed Archive - 462.9 MB - MD5: bf5921041d638ae5ea5bbc8d236344b6
Oct 7, 2025
Zhang, Yuanhan; Chew, Yunice; Dong, Yuhao; Leo, Aria; Hu, Bo; Liu, Ziwei, 2025, "Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding", https://doi.org/10.21979/N9/KTBVSQ, DR-NTU (Data), V1
We introduce the Video Thinking Test (Video-TT), a benchmark designed to assess if video LLMs can interpret real-world videos as effectively as humans. Video-TT 1) differentiates between errors due to inadequate frame sampling and genuine gaps in understanding complex visual narr...
Sep 17, 2025
Li, Ruibo; Shi, Hanyu; Wang, Zhe; Lin, Guosheng, 2025, "Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving", https://doi.org/10.21979/N9/PE8MLE, DR-NTU (Data), V1
Understanding motion in dynamic environments is critical for autonomous driving, thereby motivating research on class-agnostic motion prediction. In this work, we investigate weakly and self-supervised class-agnostic motion prediction from LiDAR point clouds. Outdoor scenes typic...
Sep 11, 2025
Dai, Yuekun; Li, Haitian; Zhou, Shangchen; Loy, Chen Change, 2025, "Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting", https://doi.org/10.21979/N9/4NI0GT, DR-NTU (Data), V1
RGBA images, with the additional alpha channel, are crucial for any application that needs blending, masking, or transparency effects, making them more versatile than standard RGB images. Nevertheless, existing image inpainting methods are designed exclusively for RGB images. Con...
ZIP Archive - 654.6 MB - MD5: 972cfdcaec94978b658a7296d3bc0dbb
Benchmark of our paper.
Sep 10, 2025
Xie, Haozhe; Chen, Zhaoxi; Hong, Fangzhou; Liu, Ziwei, 2025, "Compositional Generative Model of Unbounded 4D Cities", https://doi.org/10.21979/N9/CHQPCL, DR-NTU (Data), V1
3D scene generation has garnered growing attention in recent years and has made significant progress. Generating 4D cities is more challenging than 3D scenes due to the presence of structurally complex, visually diverse objects like buildings and vehicles, and heightened human se...
Sep 4, 2025
Li, Xiaoming; Zuo, Wangmeng; Loy, Chen Change, 2025, "Enhanced Generative Structure Prior for Chinese Text Image Super-Resolution", https://doi.org/10.21979/N9/DTZDDZ, DR-NTU (Data), V1
Faithful text image super-resolution (SR) is challenging because each character has a unique structure and usually exhibits diverse font styles and layouts. While existing methods primarily focus on English text, less attention has been paid to more complex scripts like Chinese....
Jun 12, 2025
Wu, Size; Jin, Sheng; Zhang, Wenwei; Xu, Lumin; Liu, Wentao; Li, Wei; Loy, Chen Change, 2025, "F-LMM: Grounding Frozen Large Multimodal Models", https://doi.org/10.21979/N9/M0U5AV, DR-NTU (Data), V1
Endowing Large Multimodal Models (LMMs) with visual grounding capability can significantly enhance AIs’ understanding of the visual world and their interaction with humans. However, existing methods typically fine-tune the parameters of LMMs to learn additional segmentation token...
Jun 5, 2025
Liao, Kang; Yue, Zongsheng; Wu, Zhonghua; Loy, Chen Change, 2025, "MOWA: Multiple-in-One Image Warping Model", https://doi.org/10.21979/N9/ZPPMT8, DR-NTU (Data), V1
While recent image warping approaches achieved remarkable success on existing benchmarks, they still require training separate models for each specific task and cannot generalize well to different camera models or customized manipulations. To address diverse types of warping in p...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.