Seungwook Kim

Seungwook Kim

Computer Vision & Robotics Researcher

Or you can also call me Wookie. I received my integrated MSc/PhD from POSTECH Computer Vision Lab, advised by Prof. Minsu Cho. My research spans interactive & efficient world models and VLAs (ongoing), {text,image}-to-{image,3D,video} generation (CorrespondentDream, RapidMV, FreeAction, SOLACE), visual correspondence (TransforMatcher, HCCNet, MambaMatcher, DHVR), and 2D/3D equivariance (SeLCA, CHOIR, RIST).
swookie.kim (at) gmail (dot) com wookiekim (at) postech (dot) ac (dot) kr

News

Publications & Projects

Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards

Seungwook Kim, Minsu Cho

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2026

FreeAction: Training-Free Techniques for Enhanced Fidelity of Trajectory-to-Video Generation

Seungwook Kim, Seunghyeon Lee, Minsu Cho

2025 Conference on Robot Learning @ Seoul, South Korea - Workshop on Learning to Simulate Robot Worlds

RapidMV: Leveraging Spatio-Angular Representations for Efficient and Consistent Text-to-Multi-View Synthesis

Seungwook Kim, Yichun Shi, Kejie Li, Minsu Cho, Peng Wang

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026 @ Arizona

Similarity-Aware Selective State-Space Modeling for Semantic Correspondence

Seungwook Kim, Minsu Cho

The IEEE / CVF International Conference on Computer Vision (ICCV) 2025 @ Honolulu, Hawaii - Findings (Oral)

Harnessing the Power of Training-Free Techniques for Text-to-3D Generation via Score Distillation Sampling

Junhong Lee, Seungwook Kim, Minsu Cho

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2025 @ Nashville - Workshop on AI4CC

3D Geometric Shape Assembly via Efficient Point Cloud Matching

Nahyuk Lee*, Juhong Min*, Junha Lee, Seungwook Kim, Kanghee Lee, Jaesik Park, Minsu Cho

The 41st International Conference on Machine Learning (ICML) 2024 @ Vienna

Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation

Seungwook Kim, Yichun Shi, Kejie Li, Minsu Cho, Peng Wang

Arxiv 2024

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Seungwook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024 @ Seattle

Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform

Chunghyun Park*, Seungwook Kim*, Jaesik Park, Minsu Cho

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2024 @ Seattle

Efficient Semantic Matching with Hypercolumn Correlation

Seungwook Kim, Juhong Min, Minsu Cho

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024 @ Big Island

Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning

Seungwook Kim*, Chunghyun Park, Yoonwoo Jeong, Jaesik Park, Minsu Cho

International Conference on Machine Learning (ICML) 2023 @ Honolulu

Learning Rotation-Equivariant Features for Visual Correspondence

Jongmin Lee, Byungjin Kim, Seungwook Kim, Minsu Cho

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023 @ Vancouver

Convolutional Hough Matching Networks for Robust and Efficient Visual Correspondence

Juhong Min, Seungwook Kim, Minsu Cho

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2023

SeLCA: Self-Supervised Learning of Canonical Axis

Seungwook Kim, Yoonwoo Jeong, Chunghyun Park, Jaesik Park, Minsu Cho

Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS) 2022 Workshop - Symmetry and Geometry in Neural Representations @ New Orleans

TransforMatcher: Match-to-Match Attention for Semantic Correspondence

Seungwook Kim, Juhong Min, Minsu Cho

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022 @ New Orleans

Deep Hough Voting for Robust Global Registration

Junha Lee, Seungwook Kim, Minsu Cho, Jaesik Park

The IEEE / CVF International Conference on Computer Vision (ICCV) 2021 @ Montreal

Learning to Distill Convolutional Features into Compact Local Descriptors

Jongmin Lee, Yoonwoo Jeong, Seungwook Kim, Juhong Min, Minsu Cho

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2021 @ Hawaii

Honors & Awards

Experiences

PhD Intern / Bytedance Seed Team

06.2024 - 12.2024

ByteDance, CA, USA

Worked on improving the efficiency and consistency of {image,text}-to-multiview generation models. Published 1 paper: RapidMV (WACV 2026).

PhD Intern / Data-Intelligent Creation-Vision and Graphics team

09.2023 - 02.2024

ByteDance, CA, USA

Worked on improving {image,text}-to-3D generation models. Published 2 papers: CorrespondentDream (CVPR 2024) and MultiImageDream (Arxiv 2024).

Technical Chair

05.2024 - 12.2024

ACCV, Hanoi, Vietnam

Managed the Microsoft CMT for coordinating paper submission and reviewing process.

Undergraduate Intern / 3D Map construction from LiDAR

03.2020 - 07.2020

Polaris3D, Pohang, South Korea

Implemented the process of retrieving data from Intel Realsense cameras to Jetson Nano in real-time. Merged the two streams of data from two different angles to output a 3D map in real-time.

Undergraduate Intern / Camera ISP

12.2019 - 01.2020

SK Hynix, Icheon, South Korea

Analyzed the image post-processing algorithms applied to raw images obtained from sensors.

Undergraduate Intern / AI team

06.2019 - 08.2019

Netmarble, Seoul, South Korea

Developed prior speech-to-3D lip synthesis pipeline to be light-weight (mobile-runnable) using TensorFlow.

Undergraduate Intern / Data Engineering & Analysis team

02.2018 - 12.2018

Dable, Seoul, South Korea

Analyzed heavy-traffic raw data collected at AWS RedShift using PostgreSQL. Developed batch codes and web crawling system.

Education

Integrated MSc/PhD in Graduate School of AI

09.2020 - 02.2026

POSTECH

Advisor: Prof. Minsu Cho

BSc in Computer Sciences & Engineering

2015 - 2020

POSTECH

GPA: 3.70 / 4.3