Archive Talks

Benedetta Gennaro

Technische Universität Darmstadt, Institut für Soziologie

March 11, 2015

Title: Of Breasts and Symbols: A Visual Journey through Twenty-Five Centuries of Western Art and Culture

Abstract: ▸

http://www.ifs.tu-darmstadt.de/index.php?id=3619

Michael Tarr

Carnegie Mellon University

February 26, 2015

Title: "Real stupidity beats artificial intelligence every time" (Terry Pratchett)

Abstract: ▸

How is it that biological systems can be so imprecise, so ad hoc, and so inefficient, yet accomplish (seemingly) simple tasks that still elude state-of-the-art artificial systems? In this context, I will introduce some of the themes central to CMU's new BrainHub Initiative by discussing: (1) The complexity and challenges of studying the mind and brain; (2) How the study of the mind and brain may benefit from considering contemporary artificial systems; (3) Why studying the mind and brain might be interesting (and possibly useful) to computer scientists.

Speaker Biography:

Michael J. Tarr is the Head of the Department of Psychology in Carnegie Mellon Universitys Dietrich College of Humanities and Social Sciences and the Chair of Carnegie Mellon's BrainHub Steering Committee. He studies the neural, cognitive and computational mechanisms underlying visual perception and cognition. He is particularly interested in object and face recognition, how we become visual experts for non-face object domains, and how visual perception interacts with our other senses, with cognition, and with social and affective processing. Much of his work is predicated on the idea that models of artificial and biological vision have something (meaningful) in common and that both disciplines will benefit from greater interaction. From 2009-2013, he was the co-director of the Center for the Neural Basis of Cognition (CNBC), at Carnegie Mellon. Before joining the CMU faculty in 2009, he spent 14 years on the faculty of Brown University and 6 years on the faculty of Yale University. He received his PhD from M.I.T. in 1989 and his BA from Cornell University in 1984. The National Academy of Sciences recognized Tarr with the Troland Award in 2003, given annually to honor unusual achievement and further empirical research in psychology. The American Psychological Association recognized him with the APA Early Career Award 1997. He is a fellow of the American Psychological Association and the Society of Experimental Psychologists

http://tarrlabwiki.cnbc.cmu.edu/index.php/Main_Page

Paul G. Kry

School of Computer Science, McGill University, Canada

February 24, 2015

Title: Balancing Speed and Fidelity in Physics Based Animation and Control

Abstract: ▸

http://www.cs.mcgill.ca/~kry/

Nikolaus F. Troje

BioMotion Lab, Queen's University, Canada

February 18, 2015

Title: What is biological motion?

Abstract: ▸

http://www.biomotionlab.ca/niko.php

Vladlen Koltun

Intel Labs, Santa Clara, CA, USA

February 17, 2015

Title: Reconstructing Complete 3D Models from Single Images

Abstract: ▸

http://vladlen.info

Michael Goesele

TU Darmstadt

February 16, 2015

Title: Reflecting in and on the Gradient Domain

Abstract: ▸

http://www.gris.informatik.tu-darmstadt.de/~mgoesele/

Wenzel Jakob

ETH

October 28, 2014

Title: Capturing and simulating the interaction of light with the world around us

Abstract: ▸

Driven by the increasing demand for photorealistic computer-generated images, graphics is currently undergoing a substantial transformation to physics-based approaches which accurately reproduce the interaction of light and matter. Progress on both sides of this transformation -- physical models and simulation techniques -- has been steady but mostly independent from another. When combined, the resulting methods are in many cases impracticably slow and require unrealistic workarounds to process even simple everyday scenes. My research lies at the interface of these two research fields; my goal is to break down the barriers between simulation techniques and the underlying physical models, and to use the resulting insights to develop realistic methods that remain efficient over a wide range of inputs.

I will cover three areas of recent work: the first involves volumetric modeling approaches to create realistic images of woven and knitted cloth. Next, I will discuss reflectance models for glitter/sparkle effects and arbitrarily layered materials that are specially designed to allow for efficient simulations. In the last part of the talk, I will give an overview of Manifold Exploration, a Markov Chain Monte Carlo technique that is able to reason about the geometric structure of light paths in high dimensional configuration spaces defined by the underlying physical models, and which uses this information to compute images more efficiently.

SHORT BIO: Wenzel Jakob is a Marie Curie Postdoctoral Fellow at ETH Zürich in the Institute for Visual Computing. He obtained his Ph.D. in 2013 under the supervision of Dr. Steve Marschner at Cornell University and conducted his undergraduate studies at the Karlsruhe Institute of Technology. Wenzel's experience includes research and development work at Disney Research Zurich and Weta Digital, and he is the lead developer of Mitsuba, a research-oriented open source rendering system that has become a popular research platform in rendering and appearance modeling.

http://www.mitsuba-renderer.org/~wenzel/

Konrad Schindler

ETH Zürich

October 15, 2014

Title: Images everywhere - computer vision with vehicle-mounted, airborne and tourist cameras

Abstract: ▸

http://www.igp.ethz.ch/photogrammetry/people/Schindler

Leonid Sigal

Disney Research Pittsburgh

September 15, 2014

Title: Weak-supervision for Objects Detection and Image/Video Set Summarization

Abstract: ▸

The growing scale of image and video datasets in vision makes labeling and annotation of such datasets, for training of recognition models, difficult and time consuming. Further, richer models often require richer labelings of the data, that are typically even more difficult to obtain. In this talk I will focus on two models that make use of different forms of supervision for two different vision tasks.

In the first part of this talk I will focus on object detection. The appearance of an object changes profoundly with pose, camera view and interactions of the object with other objects in the scene. This makes it challenging to learn detectors based on an object-level labels (e.g., “car”). We postulate that having a richer set of labelings (at different levels of granularity) for an object, including finer-grained sub-categories, consistent in appearance and view, and higher-order composites – contextual groupings of objects consistent in their spatial layout and appearance, can significantly alleviate these problems. However, obtaining such a rich set of annotations, including annotation of an exponentially growing set of object groupings, is infeasible. To this end, we propose a weakly-supervised framework for object detection where we discover subcategories and the composites automatically with only traditional object-level category labels as input.

In the second part of the talk I will focus on the framework for large scale image set and video summarization. Starting from the intuition that the characteristics of the two media types are different but complementary, we develop a fast and easily-parallelizable approach for creating not only video summaries but also novel structural summaries of events in the form of the storyline graphs. The storyline graphs can illustrate various events or activities associated with the topic in the form of a branching directed network. The video summarization is achieved by diversity ranking on the similarity graphs between images and video frame, thereby treating consumer image as essentially a form of weak-supervision. The reconstruction of storyline graphs on the other hand is formulated as inference of the sparse time-varying directed graphs from a set of photo streams with assistance of consumer videos.

Time permitting I will also talk about a few other recent project highlights.

http://cs.brown.edu/~ls/

Jonathan Taylor

Microsoft Research Cambridge

September 4, 2014

Title: Hands and Dolphins: Modelling Non-Rigid Shape with Subdivision Surfaces

Abstract: ▸

Abstract: I will present a general framework for modelling and recovering 3D shape and pose using subdivision surfaces.  To demonstrate this frameworks generality, I will show how to recover both a personalized rigged hand model from a sequence of depth images and a blend shape model of dolphin pose from a collection of 2D dolphin images.  The core requirement is the formulation of a generative model in which the control vertices of a smooth subdivision surface are parameterized (e.g. with joint angles or blend weights) by a differentiable deformation function.  The energy function that falls out of measuring the deviation between the surface and the observed data is also differentiable and can be minimized through standard, albeit tricky, gradient based non-linear optimization from a reasonable initial guess.  The latter can often be obtained using machine learning methods when manual intervention is undesirable.  Satisfyingly, the "tricks" involved in the former are elegant and widen the applicability of these methods.

http://research.microsoft.com/en-us/people/jota/