You are on page 1of 2

GIAN LUCA MARIOTTINI: Good morning.

We are talking today about augmented reality for enhancing your perception.
My name is Gian Luca Mariottini.
I'm an assistant professor of computer science at the University of Texas
at Arlington.
Cameras are everywhere.
Cameras are in cell phones, are used for inspection of products, in an assembly
line for automation.
Cameras are also used for medicines.
For example, for screening and diagnosis of colorectal cancer.
There are many benefits of cameras.
Cameras are low cost.
They can sense the environment without touching it.
And they provide a lot of data about the surrounding environment like color,
texture, and shape.
Augmented reality uses cameras in order to enhance our sight and our perception
about the surroundings.
Very beautiful and cool application that I would like to show you
is about navigation in a city.
Here, as you see in this picture, augmented reality
is used to overlay on top of the scene observed
by the camera useful information about restaurants, hotels,
and other areas of potential interest.
Another application is for interior design.
Cameras, for example on an iPad, can be moved around so
that the user can see where the eventual furniture will
appear in the empty apartment.
Cameras are also used for video games.
So in this case augmented reality in particular
it's used to overlay virtual characters on top of the video
while the person is playing with the video game.
Augmented reality can be used for interactive applications.
Again, for example, for art, design, or engineering.
It's very important to understand the connection between augmented reality
and mathematics.
I think it's a beautiful connection, and we are going to explore it together.
In particular, there is a very interesting mathematical model
that I would like ti present it to you that is key in augmented reality.
Let's look at it together.
Cameras can be simply model with a mathematical pinhole camera model.
This model, as you see in the image to the right,
assumes that a specific object-- in this case a tree--
projects itself onto the camera, which is the one to the right,
through a little tiny spot which we call it "pinhole."
This model using a pinhole camera model.
This model assumes that all the light reflected from the object passes
into this little point, which is called "pinhole,"
and is then reflected inside the camera, but is
inverted, as you see the image of the tree in this case is inverted.
It's very important to model this physical property
with a mathematical model, so let's look at it together.
In the image to the right you can see a smaller representation
of the top image.
So you see the tree in this case from the point p,
for example, is projected into the camera
at the point p prime to the bottom right through the center of the camera,
the pinhole, which we call it o.
So in this case, by mathematics so we can simple use triangle similarity
and see that the triangle p, o, and q-- the one that
goes from the top of the tree to the pinhole
and back to the bottom of the tree-- is similar to the triangle
p prime or q prime.
Which is the panel that goes from the center of the outcome of the orange q
prime and p prime in the image.
So these two triangles are similar.
With does it mean they are similar?
That we can write, as you see in the equation at the bottom left,
that the distance pq divided by the distance qo
is equal to the ratio between the q prime p prime divided by q prim o.
This means, effectively, if I just change the labels of these distances
that the height of the tree divided by the distance of the tree
from the camera, is equal to the projection
of the tree in the image, which we call it v, divided by the focal length.
Again, it's a simple triangle similarity.
And the important thing is that now we can extract what v is.
V, again, is the projection of that specific point p in the image.
And so as you see now in the red box, we've
created a mathematical model that tells us
that the projection of an image of the point p
is equal to the focal length times the height of the object divided
by the distance of the object from the camera.
This is actually a very intuitive model, an
and I would like to show you why in the next slide.
Let's see why this model is realistic.
Let's suppose that we are facing a railroad.
In this case, all these parallel lines of the railroad
they really go to infinity, because they're getting
further and further away from us.
And the point of intersection of these two lines
is called "vanishing point," as you see in the representation
here in the image to the right.
Can we relate this to the pinhole model that we just saw before?
Yes, indeed.
What it means that two lines are going to infinity?
That the distance of a specific point on the line is going to infinity.
So as d goes to infinity, as you see in the model to the left,
then v goes to zero.
That means that the projection of these points
actually happens to be in the center of the image, as we would expect,
as you see from the picture of the railroad.
So with this I hope that I really motivated you
more about the pinhole camera model and its application to augmented reality.

You might also like