CVMT Colloquia 2009
Moderator:
Claus B. Madsen
Approximately every other week the
CVMT group
meets for
a technical colloquium, where people
from the
group take turns
to present own recent research,
relevant research
by other groups, or
rehearse an upcoming conference
presentation.
NEXT
EVENT
planned events past events
November 18,
2009: Claus B. Madsen
Title: From shadows to illumination
Time
and place: 1 pm in 3-228 (Las Vegas room)
The talk
presents recent work in the area of detecting dynamic shadows in image
sequences using a combination of color and depth information from the
Bumblebee XB3 commercial stereo camera. The goal is to develop a
technique that can run over very long sequences in outdoor scenarios
with constant illumination changes. Therefore the technique is based on
simple image differencing rather than relying on trained background
images as is the current state-of-the-art. Dynamic shadow detection is
demonstrated on a number of outdoor image sequences.
The talk also demonstrates how the
detected dynamic shadows can be used for 1) bootstrapping a technique
for detecting ALL shadows in the scene, and 2) estimating the
illumination from the sky and from the sun.
PLANNED EVENTS
NEXT EVENT
PAST EVENTS
December 9,
2009: John Doe
Title:
Time
and place: 1 pm in 3-228 (Las Vegas room)
PAST EVENTS
NEXT EVENT
PLANNED EVENTS
November 11,
2009: Thomas Moeslund
Title: The Hermes Project
Time
and place: 1 pm in 3-228 (Las Vegas room)
HERMES is an EU project
wherein CVMT has participated in the last three years. The project
concentrates on how to extract descriptions of peoples' behaviours from
videos in a restricted discourse domain, such as pedestrians crossing
inner-city roads, transform this into written text, and allow to
synthesis a dynamic scene based on a textual description. The recorded
videos allow to explore a coherent evaluation of human movements and
facial expressions across a wide variation of scale. This general
approach lends itself to various cognitive surveillance scenarios at
varying degrees of resolution: from wide-field-of-view multiple-agent
scenes, through to more specific inferences of emotional state that
could be elicited from high resolution imagery of faces.
This talk presents the overall ideas
behind HERMES together with concrete examples on what has been done in
the project.
October 28,
2009: Michael Nielsen
Title: Peach blossoms
Time
and place: 1 pm in 3-228 (Las Vegas room)
In spring I collected 32 GB of image data
in the cold and moist
Californian peach orchards. We had built transportable 3 camera stereo
rig competing against time given by when the peach blossoms would
bloom.
Because of the winter rain season drought and delayed rain the blooming
season was delayed 2-3 weeks, and the weather made it hard to actually
work in the fields with expensive electronics in the fields.
In this colloquium I will tell about all the joys of field work, new
challenges in 3d stereo vision with 10Mpixel images, and preliminary
results.
October 7,
2009: Giang Phuong Nguyen
Title:
Context-based adaptive filtering of interest points in image retrieval
Time
and place: 1 pm in 3-228 (Las Vegas room)
Interest points have been used as local features with success in many
computer vision applications such as image/video retrieval and object
recognition. However, a major issue when using this approach is a large
number of interest points detected from each image and created a dense
feature space. This influences the processing speed in any run-time
application. Selecting the most important features to reduce the size
of the feature space will solve this problem. Thereby this raises a
question of what makes a feature more important than the others? In
this paper, we present a new technique to choose a subset of features.
Our approach differs from others in a fact that selected feature is
based on the context of the given image. Our experimental results show
a significant reduction rate of features while preserving the retrieval
performance.
September 28,
2009: Marcus Pearce, University College, London
Title: From expectation to aesthetics in the
perception of music
Time
and place: 1 pm in 3-228 (Las Vegas room)

Aesthetic accounts of
music often tend towards one of two extremes:
formalism and referentialism. Yet empirical research indicates that
people value music largely because of the emotions that it evokes.
Leonard Meyer pointed out that emotional expression, usually associated
with referentialism, is actually compatible with both accounts and used
expectation to reconcile formalism with emotional expression in music.
Recent research in psychology and neuroscience has built on this
approach studying the cognitive, physiological and neural components of
expectation in music, covering both the factors that generate
expectations and the effects of expectations on the listener.
Researchers have identified three influences on a listener's
expectations at a particular point in a novel piece of music:
extra‐opus musical experience, intra‐opus structure and universal
rules. I will argue that structural expectations are largely acquired
and that rule‐like responses emerge as part of a process of implicit
statistical learning. Turning to the effects of expectation, Meyer
argued that confirmations and violations of expectation in music
generate experiences of tension and resolution that in turn produce
valenced emotional states. Again, there is empirical evidence that
violations of expectation do indeed generate physiological signatures
of emotion and activate reward circuits in the brain. Various
explanations of how this process operates have been proposed. I suggest
that it can be linked with the process of statistical learning and
prediction that generates expectations in the first place.
September 16,
2009: Carsten Høilund
Title: Free Space
Computation From Stochastic
Occupancy Grids Based On Iconic Kalman Filtered Disparity Maps
Time
and place: 1 pm in 3-228 (Las Vegas room)

The objective is
to determine the free space in a scene as viewed by a camera on a
vehicle. Free space is defined as the area where navigation without
collision is possible. The starting point for determining this free
space is disparity maps, obtained with a stereo camera. The disparity
maps are filtered by an iconic Kalman filter, operating on each pixel
individually. Applying ego-motion, the previous disparity maps is
predicted to correspond to the current disparity map. The two, ideally
identical, disparity maps are merged by the Kalman filter yielding an
optimal estimation of the true state, thereby reducing variance and
increasing the density of the filtered disparity map.
The stochastic occupancy grids are calculated from these disparity
maps, providing a top-down view of the scene where the uncertainty of
disparity measurements are taken into account. A pixel from the
disparity map can thus affect several cells with varying likelihood.
These occupancy grids are segmented to indicate a maximum depth free of
obstacles, enabling the marking of free space in the accompanying
intensity image. The test shows successful marking of free space in the
evaluated scenarios in addition to significant improvement in disparity
map quality.
July 9,
2009: Martin Kraus (one of the applicants for our positions)
Title: What to Do with More Graphics
Performance: Bigger, Better, Faster - or Something Different?
Time
and place: 1 pm in room 3-108 (or in Las Vegas)

Graphics performance
has been increasing dramatically during the last decade.
Computer-animated movies are commonplace today and many researchers are
working on real-time rendering with the same level of quality. This
vision could be characterized as "bigger, better, faster". At the same
time, the additional graphics performance also leverages very different
approaches to many problems in computer graphics. Moreover, these new
solutions also enable us to address new challenges.
This talk will sketch some of these new solutions and challenges that
I've been working on in recent years. The topics include interactive
illustrations, image interpolation, volume visualization and realistic
rendering. I will try to show how more graphics performance not only
changed our solutions but also the challenges that we are addressing.
May 6,
2009: Preben Fihl
Title: Computer Vision-Based Analysis of
Human Actions
Time
and place: 1 pm in room 3-108 (or in Las Vegas)

The talk
will present an overview of the work I have done over the past four
years within the area of analysis of human actions.
Video-based analysis of human motion has a wide range of interesting
applications – from very detailed motion capture for movie production
to coarse estimates of the flow of people in e.g. airports. Automatic
analysis of surveillance video is another application which has
received much interest in recent years, both from the political system,
the industry and from the research community, and this talk will
present work on different aspects of automatic analysis of surveillance
video.
The talk will first present a method for foreground segmentation in
relatively unconstrained outdoor scenes. Next I will present two
different approaches to human action recognition. The first approach
recognizes a set of one-arm gestures and the second approach classifies
human gait into three types, i.e. walking, jogging, and running. The
last part of the talk will present some ongoing work on full-body pose
estimation from multiple cameras.
April 22,
2009: Anne-Marie S. Hansen
Title: Data logging for social aspects
Time
and place: 1 pm in room 3-108 (or in Las Vegas)

Based on research within
tangible embedded interaction, systematic musicology and music therapy,
Anne-Marie's Ph.D. project explores new forms of playing, learning and
socializing in an arbitrary physical space. In recent years it has
become feasible to use wireless sensor technology in everyday life
situations. This Ph.D. project investigates how users understand and
interact with wireless interconnected sensor interfaces that provide
reactive and adaptive sound feedback. The interfaces can be music
instruments that adapt to the situation created by the users. It
documents how multi-modal one-to-many, many-to-many interaction
mappings cause interaction patterns that are measured qualitatively and
quantitatively.
Anne-Marie Skriver Hansen will present the current stage of her Ph.D.
research, which involves a continuous iterative loop that evolves
around the industrial design of a tangible toy unit, the design and
implementation of sound games, field studies of children's play and the
analysis of the interaction patterns that arise from each sound game.
April 15,
2009: David Meredith (CANCELLED)
Title: Machine learning of musical style (or
analysis and transcription of performed music)
Time
and place: 1 pm in room 3-108 (or in Las Vegas)
Abstract and picture(s) to be submitted by the speaker...
April 1,
2009: IKT på Toppen (ICT on the Top)
Title: visit from representatives from the
IKT på Toppen group
Time
and place: 1 pm in room 3-108 (or in Las Vegas)
IKT på Toppen (Information and Communication Technology "on the
Top", referring to the top of Denmark, aka. Aalborg) is an Aalborg
University construct, i.e., center with the University acting as a
focal point for all the nuiversity's research and education in
Information and Communication Technology. Simultaneously they act as an
interface for the industry to access the university expertise in these
fields.
This colloquium is organized to meet some representatives from IKT
på Toppen. They will come here to inform about their role and
give a status on activities so far and it will give us an opportunity
to comment on their work and to provide new ideas.
Visit their homepage
www.ikt.aau.dk
March 25, 2009: Claus B. Madsen
Title: On the analogies between computer
graphics and classical 1D signal processing
Time
and place: 1 pm in room 3-108 (or in Las Vegas)
There are several interesting analogies between modern computer
graphics and classical 1D signal processing. Computing/determining the
reflected radiance from a surface under some illumination conditions
corresponds quite precisely to computing/determining the time-domain
response of a filter to an input signal. The talk will "indicate" some
of these analogies without making the presentation inaccessible to
people with no signal processing background (hopefully). There will
also be examples of how the big rotating arc in the lab can be/has been
used to explore these analogies.
March 11, 2009: MED4 and MED6
Title: MED Awards spring 2009
Time and place:
12 o'clock (NOTE TIME) in B3-104 (the big auditorium next to the FRB 7
canteen)
All students presently on MED4 and MED6
will present their project AV productions from last (fall) semester.
MED2 students are invited as well. Each group will have 3 minutes to
present their project and then they will show their video. The event
will culminate in electing a MED3 and a MED5 video winner. We expect to
finish around 4 pm.
March 4, 2009: Cornelius
Poepel, Fachhochschule Ansbach, Germany
Title: Audio Signal Driven Sound Synthesis -
ASDSS
Time and place:
1 o'clock in N4-117
I will present parts of my own research into Audio Signal Driven Sound
Synthesis - ASDSS as well as the University and study course I come
from. Since we have
just set up the ERASMUS exchange program between the study programs
Medialogy / Aalborg and Muiltimedia and Communication / Ansbach I will
also talk about some classes/facilities as well as student works.
Research:
Field: HCI/Sound Synthesis
Topic: Audio Signal Driven Sound Synthesis
ERASMUS exchange Uni Aalborg / HS Ansbach
Study course Multimedia and Communication
Classes, Facilities, examples of students' works
Fachhochschule Ansbach