1.Intro - (1)

CS/컴퓨터비전

1.Intro - (1)

sliver__ 2022. 3. 9. 23:04

728x90

Computer vision : Image(2D) - > Geometry(3D) + Photometry(빛)
Computer Graphics : Geometry(3D) + Photometry -> Image

Image ?
- 3D 장면을 빛을 사용하여 2D로 projection 한 것
- $ f(x,y) : R^{2} \to R $
- (x,y) : position of image plane
- f(x,y) : the light intensity at this point

Acquisition System
- World -> Camera -> Digitizer (CCD : DSLR, CMOS : 휴대폰) -> Digital Image

Sampling and Quantization
- $ y = \int_{0}^{\tau} x(t) dt $ => 타우시간 동안 shutter를 열었을 때, 신호를 합한 정보.
- 카메라 shutter를 여는 시간으로 신호의 양을 조절
  - 낮에는 빛이 많기 때문에 적은 시간만 열어둠
  - 밤에는 빛이 적기 때문에 긴 시간 열어둠

Computer Vision?
- 컴퓨터가 사람처럼 이미지를 이해하도록 만드는 기술
- Make computeers understand images and videos or any visual data.

History
- 1960's : Interpretation of synthetic(인조) worlds
- 1970's : Some progress on interpreting selected images
- 1980's : ANN come and go
- 1990's : face recognition, statistical analysis
- 2000's : broader recognition
- 2010's : Resurgence(재기) of deep learning

Challenges
- Many nuisance(성가신 것) parameters
  - Illumination(역광)
  - Object pose(다양한 포즈)
  - Clutter (흩어져 있는 것)
  - Occlusion (가려짐)
  - Intra-class apperance (같은 class 지만 비슷한 형체)
  - Viewpoint (촬영 시점)
- Importance of context
  - 컴퓨터에게 전체 문맥을 이해시켜야 작은 문제를 풀 수 있다.
  - 어떠한 부분의 사진을 보고 전체 사진을 유추할 순 없지만, 전체 사진을 보고 작은 부분을 유추할 수 있다.

Computer vision tech
- Object dectection : 물체의 위치 파악
- Recogintion : 물체가 어떤 것인지 파악
- Scene Understanding(Semantic Segmentation) : 물체를 pixel 단위로 인식
- Visual Tracking : 특정한 target만 dectecting / 현재 : 여러가지 object를 동시에 tracking
- 3D Resolution : 여러시점에서 찍은 사진의 좌표를 저장하여 좌표에 색을 입힌다(texture mapping)
- Low Level Vision : 흐릿한 이미지의 형태를 뚜렷하게 표현하는 기술
- Optical Flow Estimation : 사물의 움직임을 벡터로 나타내어 물체의 움직임을 파악, motion을 찾는데 사용
- Self-driving car
- AR / VR

728x90