51 to 60 of 223 Results
Nov 25, 2024 - S-Lab for Advanced Intelligence
Ouyang, Wenqi; Dong, Yi; Yang, Lei; Si, Jianlou; Pan, Xingang, 2024, "I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models", https://doi.org/10.21979/N9/2ZLRYG, DR-NTU (Data), V1
The remarkable generative capabilities of diffusion models have motivated extensive research in both image and video editing. Compared to video editing which faces additional challenges in the time dimension, image editing has witnessed the development of more diverse, high-quali... |
Nov 15, 2024 - Narendra VISHWAKARMA
Vishwakarma, Narendra; Swaminathan, R.; Diamantoulakis, Panagiotis D.; Karagiannidis, George K., 2024, "Related Data for: Cascaded FSO systems with optical reflecting surfaces", https://doi.org/10.21979/N9/WKU9JA, DR-NTU (Data), V1
MATLAB and Python source code the publication title: "Cascaded FSO systems with optical reflecting surfaces" These code will produce the outage probability and Bit error rate plots for the above paper |
Nov 7, 2024 - S-Lab for Advanced Intelligence
Xiao, Zeqi; Zhou, Yifan; Yang, Shuai; Pan, Xingang, 2024, "Video Diffusion Models are Training-free Motion Interpreter and Controller", https://doi.org/10.21979/N9/HQM313, DR-NTU (Data), V1
Video generation primarily aims to model authentic and customized motion across frames, making understanding and controlling the motion a crucial topic. Most diffusion-based studies on video motion focus on motion customization with training-based paradigms, which, however, deman... |
Oct 23, 2024 - S-Lab for Advanced Intelligence
Jiang, Xueying; Jin, Sheng; Zhang, Xiaoqin; Shao, Ling; Lu, Shijian, 2024, "MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders", https://doi.org/10.21979/N9/5ILJOM, DR-NTU (Data), V1
Monocular 3D object detection aims for precise 3D localization and identification of objects from a single-view image. Despite its recent progress, it often struggles while handling pervasive object occlusions that tend to complicate and degrade the prediction of object dimension... |
Oct 9, 2024 - Kai Keng ANG
Premchand, Brian; Liang, Liyuan; Kok Soon, Phua; Zhang, Zhuo; Wang, Chuanchu; Guo, Ling; Ang, Jennifer; Koh, Juliana; Yong, Xueyi; Ang, Kai Keng, 2024, "Related Data for: Wearable EEG-Based Brain–Computer Interface for Stress Monitoring", https://doi.org/10.21979/N9/ZJM6WF, DR-NTU (Data), V1
Dataset comprised EEG and ECG data collected from 40 subjects performing MMIT and CVT Tasks as described in the paper. |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Huang, Ziqi; Wu, Tianxing; Jiang, Yuming; Chan, Kelvin C. K.; Liu, Ziwei, 2024, "Replication Data for: ReVersion: Diffusion-Based Relation Inversion from Images", https://doi.org/10.21979/N9/UWSAXU, DR-NTU (Data), V1
A replication of the ReVersion Benchmark, for the paper "ReVersion: Diffusion-Based Relation Inversion from Images". |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Xie, Binzhu; Zhang, Sicheng; Zhou, Zitang; Li, Bo; Zhang, Yuanhan; Hessel, Jack; Yang, Jingkang; Liu, Ziwei, 2024, "FunQA: Towards Surprising Video Comprehension", https://doi.org/10.21979/N9/SMR703, DR-NTU (Data), V1
Surprising videos, e.g., funny clips, creative performances, or visual illusions, attract significant attention. Enjoyment of these videos is not simply a response to visual stimuli; rather, it hinges on the human capacity to understand (and appreciate) commonsense violations dep... |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Yang, Jingkang; Dong, Yuhao; Liu, Shuai; Li, Bo; Wang, Ziyue; Jiang, Chencheng; Tan, Haoran; Kang, Jiamu; Zhang, Yuanhan; Zhou, Kaiyang; Liu, Ziwei, 2024, "Octopus: Embodied Vision-Language Programmer from Environmental Feedback", https://doi.org/10.21979/N9/9EIB8X, DR-NTU (Data), V1
Large vision-language models (VLMs) have achieved substantial progress in multimodal perception and reasoning. Furthermore, when seamlessly integrated into an embodied agent, it signifies a crucial stride towards the creation of autonomous and context-aware systems capable of for... |
Oct 7, 2024 - S-Lab for Advanced Intelligence
Ma, Yubo; Zang, Yuhang; Chan, Liangyu; Chen, Meiqi; Jiao, Yizhu; Li, Xinze; Lu Xinyuan; Liu, Ziyu; Ma, Yan; Dong, Xiaoyi; Zhang, Pan; Pan, Liangming; Jiang, Yu-Gang; Wang, Jiaqi; Cao, Yixin; Sun, Aixin, 2024, "Replication Data for: MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations", https://doi.org/10.21979/N9/IMVWT4, DR-NTU (Data), V1
Understanding documents with rich layouts and multi-modal components is a long-standing and practical task. Recent Large Vision-Language Models (LVLMs) have made remarkable strides in various tasks, particularly in single-page document understanding (DU). However, their abilities... |
Oct 5, 2024 - Narendra VISHWAKARMA
Vishwakarma, Narendra; R., Swaminathan; Premanand, Rithwik; Sharma, Shubha; Madhukumar, A. S., 2024, "Related Data for: RIS-assisted hybrid FSO/THz system with diversity combining schemes: A performance analysis", https://doi.org/10.21979/N9/A7QMG1, DR-NTU (Data), V1
MATLAB source code the publication title: "RIS-assisted hybrid FSO/THz system with diversity combining schemes: A performance analysis" These code will produce the outage probability and Bit error rate with asymptotic plots for the above paper |
