Home Subtitle videos The next computer? Your glasses

The next computer? Your glasses

Video thumbnail
Aktie:
00:04

For our entire lives, we’ve been living through a computing revolution.

00:08

Many of you here in this room have contributed to it,

00:11

with leaps forward in compute,

00:13

connectivity, mobile technologies,

00:16

and now AI.

00:18

For my part, I've dedicated my entire career to augmented reality,

00:23

fusing the real world with computing experiences.

00:26

I started this journey about 25 years ago for my PhD.

00:31

It might sound groundbreaking, but check out these early prototypes.

00:35

The technology was really primitive back then,

00:38

but the potential felt limitless

00:40

when we were experimenting in that university lab.

00:44

What I didn't know at that time

00:45

was many of the fundamental innovations for my work

00:49

would come from AI researchers in a different lab.

00:54

While I was helping computers see the world,

00:56

these AI researchers were helping computers reason about the world.

01:01

Since then, large language models and multimodal reasoning

01:05

have enabled richer language and image understanding.

01:10

These models are now fast enough for live conversations

01:14

where the AI can act on your behalf

01:16

and, most importantly, with your permission.

01:20

And augmented reality and virtual reality has moved computing

01:25

from the rectangular screen to the 360 immersive display

01:29

to now even the world itself becoming the display.

01:33

We now refer to this broad collection of experiences as extended reality or XR.

01:39

Until now,

01:41

these innovations have occurred separately and in silos.

01:46

Well here comes act two of the computing revolution.

01:50

AI and XR are converging,

01:52

unlocking radical new ways to interact with technology on your terms.

01:58

Computers will become more lightweight and personal.

02:02

They will share your vantage point,

02:04

understand your real-world context

02:06

and have a natural interface that's both simple and conversational.

02:12

Countless people and companies are innovating in this space,

02:17

including many on the TED stage this week.

02:20

We're excited to contribute to this momentum with Android XR.

02:25

It's an operating system we're building with Samsung

02:28

that brings XR hardware together with Gemini, our AI assistant,

02:33

to augment human intelligence.

02:36

It can support a broad range of devices,

02:39

from headsets to glasses

02:41

to form factors we haven't even dreamed of yet.

02:44

OK, let me show you where we’re heading

02:47

by inviting my colleague Nishtha to the stage.

02:51

Welcome, Nishtha.

02:52

(Applause)

02:54

Nishtha Bhatia: Hi.

02:56

Shahram Izadi: These ordinary-looking glasses are packed full of technology.

02:59

A miniaturized camera and microphones

03:02

give the AI the ability to see and hear the world.

03:05

Speakers let you listen to the AI and play music or even take calls.

03:10

And these glasses also have a tiny,

03:13

high-resolution in-lens display

03:16

that's full color

03:17

that I'm holding in my hand.

03:19

It's very, very small.

03:22

These glasses work with your phone streaming back and forth,

03:26

allowing the glasses to be very lightweight

03:28

and access all your phone apps.

03:30

And if you're wondering, I'm actually wearing the glasses too.

03:33

They're actually helping me see all of you in the audience

03:36

because they have prescription lenses inside them.

03:38

But they’re also displaying my speaker notes for me as well.

03:41

(Laughter)

03:43

For this demo,

03:44

you'll see what Nishtha is seeing on the screen behind her.

03:47

And this is the very first time we're showing these glasses in public.

03:51

So this is going to be a live demo of conceptual hardware, early software.

03:56

What could go wrong?

03:58

Nishtha, best of luck.

04:00

No pressure.

04:01

NB: Amazing.

04:03

Alright, let’s just make sure that these glasses are shown

04:06

on the screen behind us.

04:08

OK.

04:09

[Glasses screen off]

04:13

Awesome.

04:14

(Applause)

04:16

Now I'm going to launch Gemini.

04:19

Gemini: Hello there.

04:21

NB: Hi, Gemini.

04:22

Can you start us off with something fun and creative,

04:25

and write a haiku for what I'm seeing right now?

04:30

G: Sure.

04:31

Faces all aglow.

04:32

Eager minds await the words.

04:33

Sparks of thought ignite.

04:36

SI: Some anxious but happy faces as well, yeah.

04:40

As you can see, the AI sees what Nishtha sees,

04:42

hears what she hears

04:44

and is responding in real time.

04:46

But that’s just table stakes.

04:47

I think everyone in this audience has seen demos like this before.

04:51

Let's actually step it up a notch with something we call Memory.

04:54

For a rolling contextual window,

04:56

the AI remembers what you see

04:58

without having to be told what to keep track of.

05:02

NB: So you guys may have noticed

05:04

I snuck a peek back at the shelf a moment ago.

05:07

I wasn't paying attention, but let's see if Gemini was.

05:12

Hey, did you happen to catch the title of the white book

05:17

that was on the shelf behind me?

05:20

G: The white book is “Atomic Habits” by James Clear.

05:24

NB: That is absolutely right.

05:26

So let's try something harder.

05:28

I keep losing my hotel key card.

05:31

Do you know where I last left the card?

05:35

G: The hotel key card is to the right of the music record.

05:39

NB: Great.

05:41

SI: For someone as forgetful as me, that's a killer app.

05:45

OK, this AI is inherently multimodal.

05:48

So it's using visual data and natural language

05:50

to digest complex queries in real time

05:53

while remembering your past context.

05:56

Nishtha, let's see some more.

05:57

NB: Yeah, let’s bring this book back.

06:00

So I don't really have time to read it right now.

06:03

But I wonder if Gemini might be able to help me summarize what's going on.

06:08

Like, I don't know.

06:10

What does this diagram mean?

06:12

It looks complex.

06:14

G: This diagram is labeled “The Habit Line”

06:17

and illustrates the process of habit formation.

06:20

It shows how automaticity increases with repetitions

06:23

until a habit is formed.

06:25

NB: OK.

06:26

SI: Now imagine being able to communicate and understand any language.

06:30

The AI can shift its output to speak different languages

06:33

with dialects and accents that feel human and natural.

06:37

NB: Yeah, let’s go to this sign over here.

06:41

Can you translate this to English for me?

06:45

G: The sign states: “Private property, no trespassing.”

06:49

SI: OK, let's prove this is a live demo.

06:51

Does anyone in the audience have a suggestion

06:53

for a different language to translate to?

06:55

Audience: Farsi.

06:56

SI: Farsi.

06:58

We tried Farsi, it failed last time, but let's try it again.

07:01

NB: Do you want to try Farsi?

07:03

SI: Because I do speak Farsi.

07:04

It's my mother tongue, so thank you.

07:07

NB: Gemini, let’s just give this a shot.

07:09

Can you translate this sign to Farsi for us?

07:14

G: (Speaking Farsi)

07:17

SI: Great, awesome.

07:19

It speaks Farsi.

07:21

NB: That’s amazing.

07:22

So, as Shahram mentioned,

07:24

you all may have seen translation demos like this before,

07:28

but what's new now is that in addition to just saying things

07:32

in a different language,

07:33

I can also speak to Gemini in another language.

07:36

I know Hindi, so let's give this a shot.

07:41

(Speaks Hindi)

07:49

G: (Speaks Hindi)

07:52

NB: So Gemini said you all look focused and excited,

07:55

and it has a better accent than I do.

07:57

(Laughter)

07:59

SI: Alright, now let's see how the AI can connect the physical world

08:03

with your digital content and take action.

08:06

NB: Yeah, let’s get some music going in here.

08:09

OK, Gemini, why don't you play us a track from this record?

08:16

GB: Sure, here’s “Bad Dreams” by Teddy Swims.

08:20

(Music)

08:23

SI: Perfect.

08:24

In a few seconds, the AI recognized the album,

08:27

looked up the tracklist

08:28

and took action using the phone apps.

08:30

OK, Nishtha, it does look like the demo gods are with us.

08:33

Maybe with you more than me.

08:35

Let's do one last demo

08:37

I know you're keen to give.

08:38

NB: Yes, this is my first time in Vancouver,

08:41

and I love going on walks.

08:43

So why don't you navigate me to a park nearby with views of the ocean?

08:51

G: OK, I am starting navigation to Lighthouse Park,

08:54

which has magnificent views of the Pacific Ocean.

08:58

Is there anything else I can assist you with?

09:00

NB: Honestly, with these directions and a 3D map,

09:04

I should be all set, and hopefully I won’t look like a tourist.

09:07

Thank you all.

09:08

(Applause)

09:15

SI: Thank you, Nishtha, that was awesome.

09:17

OK, we've seen glasses.

09:19

Now let's turn our attention to the other side of the spectrum: headsets.

09:23

You've seen these types of devices before,

09:26

but when we first brought AI to a headset,

09:28

it completely caught me by surprise.

09:31

For this demo, we're going to use the Project Moohan headset

09:33

that Samsung is launching later this year.

09:36

Compared to glasses, headsets give you an infinite display

09:40

for getting work done or immersing yourself in a movie,

09:43

or maybe a TED Talk at some point.

09:45

Let me bring up my colleague Max to show us even more.

09:48

Hey, Max.

09:49

Max Spear: Hello.

09:50

(Applause)

09:51

SI: And the same thing is going to go here.

09:54

You'll see exactly what Max is seeing behind on the screen.

09:58

Go for it, Max.

09:59

MS: Let’s do it.

10:01

You'll notice we start grounded in the real world.

10:03

And I'm going to control the entire system with my eyes, hands and voice.

10:08

But where things get really interesting

10:10

is when we invite Gemini in as this conversational companion

10:14

that can come with us anywhere.

10:17

G: Hello.

10:19

MS: Hey, Gemini, can you bring up my trip planner for me, please?

10:23

G: Of course. Opening up your trip planner.

10:25

MS: Cool, but I left these windows really disorganized.

10:28

Can you help with that?

10:31

G: Of course I can help with that.

10:34

SI: No clicks, no keyboards.

10:36

It's just a conversation.

10:38

And the AI is taking action.

10:41

OK, some more audience participation.

10:43

Someone shout out a name of a place you want to visit.

10:46

Audience: Melbourne.

10:48

Audience: Cape Town.

10:49

SI: Let's go to Cape Town.

10:50

Max: OK, sounds fun.

10:52

Can you please take me to Cape Town?

10:57

G: Certainly.

10:58

Let me help with organizing the windows.

11:00

SI (Laughs)

11:01

MS: Awesome.

11:02

And can you also take me to Cape Town?

11:04

(Laughter)

11:06

G: I can certainly do that.

11:07

Let me take you to Cape Town.

11:09

MS: And we’re very organized as we go there, perfect.

11:12

SI: As you can see, the AI is taking Max's requests,

11:15

figuring out how best to answer it,

11:17

opening up the Maps app.

11:19

And from there, he can actually explore anywhere in the world in this 3D view.

11:24

MS: OK, this is pretty incredible.

11:26

Viewing the world from this angle, I can even zoom in to city levels.

11:29

But what's really interesting is having an AI here

11:32

who can see what I see.

11:33

Can you tell me more about the significance of this place?

11:39

G: I can indeed provide information about Table Mountain.

11:43

Table Mountain holds a profound significance,

11:45

deeply rooted in both its natural splendor

11:47

and its rich cultural history.

11:49

It has long been a spiritual and cultural symbol

11:52

for the Indigenous Khoekhoe and San people.

11:54

MS: OK, very good, Gemini.

11:56

I'll come back to this later.

11:57

It also does look like a table, that's handy.

12:00

SI: OK, Max, let's bring this closer to home.

12:03

How about exploring winter sports, seeing that we're in Vancouver.

12:06

MS: OK, let’s get some inspiration for Shahram.

12:08

Can you please show me 360 videos of this?

12:11

[Powder snowboarding]

12:14

G: Sure, here are some 360-degree powder snowboarding videos.

12:18

MS: OK, I’m kind of new to snowboarding,

12:21

so hopefully this video will help me learn.

12:23

Although this looks already way more intense

12:25

than I think I'm ready for.

12:28

What was the name of that trick?

12:32

G: That’s a backside 540 with a grab.

12:36

SI: The AI knows exactly what Max is looking at,

12:38

even in rich video content,

12:40

and can answer any question in real-time.

12:43

Max, are you sure this is Whistler?

12:46

MS: You know, Shahram, I actually have no idea.

12:48

I've never been here before, but let's find out.

12:52

What run do you think this is?

12:57

G: It’s probably one of the ungroomed runs in summit bowl

13:00

or the top section of West Ridge on Mount Bachelor,

13:02

as they are close to the summit and face north

13:05

towards Three Sisters and Broken Top,

13:06

the prominent mountains visible in the background.

13:09

MS: OK, Mount Bachelor, Oregon, got it.

13:11

SI: OK, let's have a little bit of fun now.

13:14

Let's have AI narrate this video.

13:16

Somebody in the audience, shout out a fun narration style

13:19

like, nature documentary or something like that.

13:22

Audience: Morgan Freeman.

13:24

SI: We can't do actors, but we'll get to that at some point.

13:28

Any other suggestions?

13:30

Audience: Movie trailer.

13:31

Audience: Horror movie.

13:33

SI: Horror movie.

13:34

That's going to be -- it might be gory, but let's go for it.

13:37

MS: Slightly dark, but let’s do it.

13:40

Can you please describe what you're seeing

13:42

as if you're an overly enthusiastic character in a horror movie?

13:49

G: Oh, darling, this is simply marvelous.

13:51

A desolate mountain scape.

13:52

How delightfully ominous.

13:54

Prepare for a chilling dance with the unknown.

13:57

Where every gust of wind whispers tales of icy terror and lurking shadows.

14:01

MS: OK, Gemini, that’s great.

14:03

Return to more positive things another day.

14:06

SI: OK, Max, I know there's one last demo,

14:08

and you've been apparently practicing your farming skills?

14:12

MS: Yes, Shahram, I’ve been excited to get back to my rural roots

14:15

and also embrace one of my favorite things,

14:18

which is playing games.

14:19

But it becomes a whole new, next-level experience

14:22

when you have an AI by your side,

14:23

both to help keep you company as you play,

14:25

but also to help you learn how to play these kind of games in the first place.

14:29

So, for example, what do you think I should do next here?

14:36

G: It looks like this is early in the game in “Stardew Valley.”

14:40

You need to earn money, so try planting some seeds.

14:42

First, use your hoe to till the soil, and then plant your seeds.

14:45

It looks like you have some parsnips.

14:47

Since it isn't raining,

14:48

you'll also need to water them with your watering can.

14:52

MS: Seems like a lot, and maybe robots will do this for us one day,

14:55

but for now, I'm going to have to return back to my farm later.

14:58

For one last thing, can you please give a playful two-sentence summary

15:01

of all the ground we covered today?

15:07

G: We soared from organizing a digital realm

15:09

to traversing snowy mountains,

15:11

and even found time to farm some virtual parsnips.

15:14

Quite the adventure today.

15:16

MS: Quite the adventure, indeed.

15:17

Thanks, Gemini.

15:18

(Applause)

15:24

SI: Fantastic, thank you Max.

15:26

We're entering an exciting new phase of the computing revolution.

15:30

Headsets and glasses are just the beginning.

15:33

All this points to a single vision of the future,

15:36

a world where helpful AI will converge with lightweight XR.

15:41

XR devices will become increasingly more wearable,

15:44

giving us instant access to information.

15:47

While AI is going to become more contextually aware, more conversational,

15:51

more personalized,

15:53

working with us on our terms and in our language.

15:57

We're no longer augmenting our reality,

15:59

but rather augmenting our intelligence.

16:02

Thank you so much.

AITransDub

AI-betriebene Videoübersetzung und Synchronisation

Sprachbarrieren sofort brechen! AI-betriebene Präzisionsübersetzung und blitzschnelle Synchronisation für Ihre Videos.