제8회 전자공학회 신호처리소사이어티 영상이해연구회 여름학교

강연요약

강연자	강연 내용
민동보 교수 (이화여자대학교)	Large scale outdoor RGB+D dataset and its application to single image depth estimation Obtaining 3D depth of a scene is essential to alleviate a number of challenges in computer vision tasks. Recently, monocular depth estimation that aims to estimate the depth map from a single image has been studied actively thanks to the widespread of deep neural networks. This talk introduces our recent works for the monocular depth estimation, including constructing a large-scale RGB+D dataset and novel convolutional neural networks based algorithms. I’ll first present the massive RGB+D dataset, called DIML/CVL dataset, which provides 1M color images and associated depth maps taken in outdoor environments. This includes several pre-processing algorithms and detailed performance analysis of our dataset. Then, I’ll introduce a novel approach for monocular depth estimation.
김창익 교수 (KAIST)	그래프 신호처리 및 그래프기반 딥러닝 (Graph Signal Processing and graph CNNs) 그래프 신호처리는 음성이나 영상신호와 같이 규칙적인 어레이 상에 정의되는 신호와 달리 비규칙적인 그래프 영역에서 정의되는 신호를 다루는 기술이다. 이 강의에서, 먼저 그래프 신호처리의 기본 개념과 전통적인 디지탈 신호처리 기법들과의 관련성을 알아 본다. 그리고 나서 Spectral graph 이론에 대해 자세히 설명하고, 이 이론을 기반으로 graph image filtering (denoising), spectral clustering 및 non-linear dimension reduction 등의 작업을 수행하는 원리를 설명한다. 끝으로, 이 이론이 그래프기반 딥 러닝에 적용되는 예를 설명하여 기존의 자연언어 또는 영상 형태의 데이터만이 아닌 소셜 네트워크 데이터, 통신에서의 센서 네트워크 데이터, 뇌 영상에서의 기능 네트워크 데이터, 3차원 메쉬 데이터 (3D geometric data) 등의 비정형 데이터를 딥 러닝에 적용하는데 관심이 있는 연구자들이 반드시 알아야 할 기본지식을 제공한다.
석흥일 교수 (고려대학교)	Generative Adversarial Learning and Its Application to Medical Image Synthesis For the last years, deep learning has greatly impacted to the field of image processing and understanding, thanks to its architectural nature of discovering feature representations computationally. It is of no exception for medical imaging contributing greatly by demonstrating potential to complement image interpretation, augment image representation, and identification. In this talk, I will first introduce one of the most interesting recent breakthroughs in the field of deep learning - generative adversarial learning – and present its potential application to medical image synthesis.
고병철 교수 (계명대학교)	Fast Depth Map Estimation Using Light-weight CNN In a Single Image and its applications 깊이 맵은 3차원 구조를 이해하는데 핵심적인 요소이다. 일반적으로 깊이맵은 스테레오 카메라와 같은 부가적인 장비를 이용해야만 정확한 깊이를 획득할 수 있지만 최근 신경망을 이용한 연구가 활발히 진행되며 합성곱 신경망을 이용한 생성 방법들이 소개되고 있다. 본 강의에서는 적외선이나 다시점 카메라를 활용하여 3차원 깊이 정보를 얻어내는 대신에 단일 영상과 최적화된 CNN을 이용하여 빠른 시간 내에 깊이 맵을 생성하는 알고리즘을 소개한다. 더욱 정확한 깊이 맵 예측을 위해 픽셀 기반의 순서 회귀 (Ordinal regression)로 대략적인 깊이의 범위를 결정하는 방법과, 조밀한 특징 맵 (Dense feature map) 추출을 위해 다양한 컨벌루션 기법을 적용하는 방법을 소개한다. 학습과정에서 뉴런의 소멸을 막기 위해 학습 가능한 활성화 함수를 채용해 학습에 보다 유연한 모델을 설계하는 방법과, 경량 CNN알고리즘을 통해 단일영상으로부터 생성한 3차원 깊이 맵이 실측정보(ground truth)와 비교할 때 오차가 작으면서도 빠른 시간 내에 깊이 맵이 생성됨을 보여주고 이를 활용한 블록 3D 모델 자동생성, 보행자 추적 성능 향상 등의 응용 연구들을 소개한다.
변혜란 교수 (연세대학교)	Deep Learning Trend of CVPR 2019 in Image Understanding 본 강의에서는 지난 6월20일에 종료된 CVPR2019에서 발표된 ‘장면분석 및 이해’와 관련된 주요 논문을 리뷰 하도록 한다. 본 강연에서는 주로 ‘semantic image segmentation’과 관련된 논문 소개에 초점을 맞추도록 하며 의미 있는 영상 분석과 세그멘테이션을 위해 딥 러닝 기술들이 어떻게 다르게 발전되고 있는지를 비교 분석하여 소개하도록 한다.
한보형 교수 (서울대학교)	Neural Architecture Compression and Search 딥 뉴럴 네트워크는 대부분 많은 메모리와 계산량을 필요로 하기 때문에 모바일 디바이스나 임베디드 시스템 등에 사용되기에 여러 가지 어려움이 있다. 이러한 문제를 해결하기 위하여 모델을 압축하는 기술들이 소개되어 상대적으로 매우 적은 리소스만으로도 높은 성능을 달성하는 딥 뉴럴 네트워크가 개발되었다. 더 나아가서는 사람이 모델의 구조를 수동으로 디자인 하는 대신에 학습 알고리즘 내에서 모델의 구조와 파라메터를 동시에 학습하는 Neural Architecture Search (NAS) 기술들이 최근 각광받고 있다. 본 강의에서는 딥 뉴럴 네트워크의 모델을 압축하는 기술과 AutoML의 중요한 문제 중의 하나인 NAS 기술들에 대하여 소개하고 현재 기술의 발전 방향과 한계점에 대해서 논의하고자 한다.
이승용 교수 (POSTECH)	RGB-D 영상을 이용한 삼차원 복원 (3D Reconstruction Using RGB-D Images) RGB-D 카메라가 등장함에 따라 RGB-D 영상들을 이용한 다양한 삼차원 복원기법들이 개발되고 있다. 이 강의에서는 KinectFusion으로 대표되는 RGB-D 영상을 이용한 볼륨 기반 삼차원 복원의 기본 개념을 소개하고, 삼차원 복원의 품질을 향상시키는 기법들을 다룬다. 입력 RGB-D 영상의 노이즈를 제거하는 기법, 삼차원 복원의 결과로 단순한 메쉬가 아니라 물체 유형별로 분할된 메쉬를 생성하는 의미론적 복원(semantic reconstruction) 기법, 물체의 색상정보를 세밀하게 표현하는 텍스처 생성 기법을 소개한다. 강의 후반부에는 시간에 따라 모양이 변화하는 비강체 복원(non-rigid reconstruction)을 다룬다. 삼차원 비강체 복원을 위한 다양한 접근방법들과 최근 연구결과를 간략히 소개한다.
최종현 교수 (GIST)	뇌-Multi-task Learning on Estimating Depth and Optical Flow in Very High Frame Rate I’ll address a problem of estimating depth and optical flow in very high frame rate camera. For the high frame rate in a reasonable cost, an event camera is employed to capture the binary difference of each pixel. A deep neural network is learned to jointly estimate a RGB image, depth and optical flow simultaneously.