1 to 10 of 244 Results
Nov 15, 2024 - Narendra VISHWAKARMA
Vishwakarma, Narendra; Swaminathan, R.; Diamantoulakis, Panagiotis D.; Karagiannidis, George K., 2024, "Related Data for: Cascaded FSO systems with optical reflecting surfaces", https://doi.org/10.21979/N9/WKU9JA, DR-NTU (Data), V1
MATLAB and Python source code the publication title: "Cascaded FSO systems with optical reflecting surfaces" These code will produce the outage probability and Bit error rate plots for the above paper |
Nov 7, 2024 - S-Lab for Advanced Intelligence
Xiao, Zeqi; Zhou, Yifan; Yang, Shuai; Pan, Xingang, 2024, "Video Diffusion Models are Training-free Motion Interpreter and Controller", https://doi.org/10.21979/N9/HQM313, DR-NTU (Data), V1
Video generation primarily aims to model authentic and customized motion across frames, making understanding and controlling the motion a crucial topic. Most diffusion-based studies on video motion focus on motion customization with training-based paradigms, which, however, deman... |
Oct 23, 2024 - S-Lab for Advanced Intelligence
Jiang, Xueying; Jin, Sheng; Zhang, Xiaoqin; Shao, Ling; Lu, Shijian, 2024, "MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders", https://doi.org/10.21979/N9/5ILJOM, DR-NTU (Data), V1
Monocular 3D object detection aims for precise 3D localization and identification of objects from a single-view image. Despite its recent progress, it often struggles while handling pervasive object occlusions that tend to complicate and degrade the prediction of object dimension... |
Oct 9, 2024 - Kai Keng ANG
Premchand, Brian; Liang, Liyuan; Kok Soon, Phua; Zhang, Zhuo; Wang, Chuanchu; Guo, Ling; Ang, Jennifer; Koh, Juliana; Yong, Xueyi; Ang, Kai Keng, 2024, "Related Data for: Wearable EEG-Based Brain–Computer Interface for Stress Monitoring", https://doi.org/10.21979/N9/ZJM6WF, DR-NTU (Data), V1
Dataset comprised EEG and ECG data collected from 40 subjects performing MMIT and CVT Tasks as described in the paper. |
Oct 9, 2024
Appointment: Adjunct Associate Professor |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Huang, Ziqi; Wu, Tianxing; Jiang, Yuming; Chan, Kelvin C. K.; Liu, Ziwei, 2024, "Replication Data for: ReVersion: Diffusion-Based Relation Inversion from Images", https://doi.org/10.21979/N9/UWSAXU, DR-NTU (Data), V1
A replication of the ReVersion Benchmark, for the paper "ReVersion: Diffusion-Based Relation Inversion from Images". |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Xie, Binzhu; Zhang, Sicheng; Zhou, Zitang; Li, Bo; Zhang, Yuanhan; Hessel, Jack; Yang, Jingkang; Liu, Ziwei, 2024, "FunQA: Towards Surprising Video Comprehension", https://doi.org/10.21979/N9/SMR703, DR-NTU (Data), V1
Surprising videos, e.g., funny clips, creative performances, or visual illusions, attract significant attention. Enjoyment of these videos is not simply a response to visual stimuli; rather, it hinges on the human capacity to understand (and appreciate) commonsense violations dep... |
Oct 8, 2024 - S-Lab for Advanced Intelligence
Yang, Jingkang; Dong, Yuhao; Liu, Shuai; Li, Bo; Wang, Ziyue; Jiang, Chencheng; Tan, Haoran; Kang, Jiamu; Zhang, Yuanhan; Zhou, Kaiyang; Liu, Ziwei, 2024, "Octopus: Embodied Vision-Language Programmer from Environmental Feedback", https://doi.org/10.21979/N9/9EIB8X, DR-NTU (Data), V1
Large vision-language models (VLMs) have achieved substantial progress in multimodal perception and reasoning. Furthermore, when seamlessly integrated into an embodied agent, it signifies a crucial stride towards the creation of autonomous and context-aware systems capable of for... |
Oct 7, 2024 - S-Lab for Advanced Intelligence
Ma, Yubo; Zang, Yuhang; Chan, Liangyu; Chen, Meiqi; Jiao, Yizhu; Li, Xinze; Lu Xinyuan; Liu, Ziyu; Ma, Yan; Dong, Xiaoyi; Zhang, Pan; Pan, Liangming; Jiang, Yu-Gang; Wang, Jiaqi; Cao, Yixin; Sun, Aixin, 2024, "Replication Data for: MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations", https://doi.org/10.21979/N9/IMVWT4, DR-NTU (Data), V1
Understanding documents with rich layouts and multi-modal components is a long-standing and practical task. Recent Large Vision-Language Models (LVLMs) have made remarkable strides in various tasks, particularly in single-page document understanding (DU). However, their abilities... |
Oct 5, 2024 - Narendra VISHWAKARMA
Vishwakarma, Narendra; R., Swaminathan; Premanand, Rithwik; Sharma, Shubha; Madhukumar, A. S., 2024, "Related Data for: RIS-assisted hybrid FSO/THz system with diversity combining schemes: A performance analysis", https://doi.org/10.21979/N9/A7QMG1, DR-NTU (Data), V1
MATLAB source code the publication title: "RIS-assisted hybrid FSO/THz system with diversity combining schemes: A performance analysis" These code will produce the outage probability and Bit error rate with asymptotic plots for the above paper |