On rendering synthetic images for training an object detector

Computer Vision and Image Understanding - Tập 137 - Trang 24-37 - 2015
Artem Rozantsev1, Vincent Lepetit2,1, Pascal Fua1
1École Polytechnique Fédérale de Lausanne, Computer Vision Laboratory, Lausanne, Switzerland
2Graz University of Technology, Institute for Computer Graphics and Vision, Graz, Austria

Tóm tắt

Từ khóa


Tài liệu tham khảo

Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean, A. Ng, Building high-level features using large scale unsupervised learning, in: International Conference on Machine Learning, 2012.

Shotton, 2012, Efficient human pose estimation from single depth images, IEEE Trans. Pattern Anal. Mach. Intell., 99

C. Burges, B. Schölkopf, Improving the accuracy and speed of support vector machines, in: Advances in Neural Information Processing Systems, 1997, pp. 375–381.

Decoste, 2002, Training invariant support vector machines, Mach. Learn., 46, 161, 10.1023/A:1012454411458

Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition, in: Intelligent Signal Processing, 2001, pp. 306–351.

T. Varga, H. Bunke, Generation of synthetic training data for an HMM-based handwriting recognition system, in: 7th Int. Conference on Document Analysis and Recognition, 2003, pp. 618–622.

Fleuret, 2001, Coarse-to-fine visual selection, Int. J. Comput. Vision, 41, 85, 10.1023/A:1011113216584

V. Lepetit, P. Lagger, P. Fua, Randomized trees for real-time keypoint recognition, in: Conference on Computer Vision and Pattern Recognition, 2005.

J. Marin, D. Vázquez, D. Geronimo, A.M. Lopez, Learning appearance in virtual scenarios for pedestrian detection, in: Conference on Computer Vision and Pattern Recognition, 2010, pp. 137–144.

L. Pishchulin, A. Jain, A. Mykhaylo, T. Thormaehlen, B. Schiele, Articulated people detection and pose estimation: reshaping the future, in: Conference on Computer Vision and Pattern Recognition, 2012.

K. Rematas, T. Ritschel, M. Fritz, T. Tuytelaars, Image-based synthesis and re-synthesis of viewpoints guided by 3D models, in: Conference on Computer Vision and Pattern Recognition, 2014.

P. Felzenszwalb, D. Mcallester, D. Ramanan, A discriminatively trained, multiscale, deformable part model, in: Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8.

Y. Freund, R. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, in: European Conference on Computational Learning Theory, 1995, pp. 23–37.

Serre, 2007, Robust object recognition with cortex-like mechanisms, IEEE Trans. Pattern Anal. Mach. Intell., 29, 411, 10.1109/TPAMI.2007.56

Lepetit, 2006, Keypoint recognition using randomized trees, IEEE Trans. Pattern Anal. Mach. Intell., 28, 1465, 10.1109/TPAMI.2006.188

D. Cireşan, A. Giusti, L. Gambardella, J. Schmidhuber, Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images, in: Advances in Neural Information Processing Systems, 2012.

A. Handa, R. Newcombe, A. Angeli, A. Davison, Real-time camera tracking: when is high frame-rate best?, in: European Conference on Computer Vision, 2012, pp. 222–235.

Horn, 1981, Determining optical flow, Artif. Intell., 17, 185, 10.1016/0004-3702(81)90024-2

Barron, 1994, Performance of optical flow techniques, Int. J. Comput. Vision, 12, 43, 10.1007/BF01420984

M. Stark, M. Goesele, B. Schiele, Back to the future: learning shape models from 3D CAD data, in: British Machine Vision Conference, 2010, pp. 1061– 10611.

J. Liebelt, C. Schmid, Multi-view object class detection with a 3D geometric model, in: Conference on Computer Vision and Pattern Recognition, 2010.

Baker, 2011, A database and evaluation methodology for optical flow, Int. J. Comput. Vision, 92, 1, 10.1007/s11263-010-0390-2

B. Kaneva, A. Torralba, W. Freeman, Evaluation of image features using a photorealistic virtual world, in: International Conference on Computer Vision, 2011.

V. Athitsos, S. Sclaroff, Estimating 3D hand pose from a cluttered image, in: Conference on Computer Vision and Pattern Recognition, 2003, pp. 432–439.

L. Taycher, G. Shakhnarovich, D. Demirdjian, T. Darrell, Conditional random people: tracking humans with CRFs and grid filters, in: Conference on Computer Vision and Pattern Recognition, 2006.

Klein, 2010, Simulating low-cost cameras for augmented reality compositing, IEEE Trans. Visual Comput. Graphics, 16, 369, 10.1109/TVCG.2009.210

Kirkpatrick, 1983, Optimization by simulated annealing, Science, 220, 671, 10.1126/science.220.4598.671

N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in: Conference on Computer Vision and Pattern Recognition, 2005.

K. Levi, Y. Weiss, Learning object detection from a small number of examples: the importance of good features, in: Conference on Computer Vision and Pattern Recognition, 2004.

T. Ruzic, A. Pizurica, Texture and color descriptors as a tool for context-aware patch-based image inpainting, in: SPIE Electronic Imaging, 2012.