Tech Overflow

Is Alexa Really Listening?

Hannah Clayton-Langton and Hugh Williams Season 2 Episode 1

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 47:49

Send a text

Ever had an ad land so perfectly it felt like your phone must be listening? We open season two by pulling back the curtain on why targeting feels psychic without constant eavesdropping. Smart speakers like Alexa and Siri rely on wake words and short cloud trips to respond, but the real signals come from everyday behaviour: where we go, what we search, how we scroll, who we share with, and even the Wi‑Fi we share at home.

We walk through the mechanics in plain English. Location is a powerhouse signal, honed by teams obsessed with that blue dot. Search keywords and time of day amplify intent. On Instagram and Facebook, taps, pauses, replays and shares teach models what you truly like, while the friend graph links your interests to those closest to you. That’s why you’ll see cold-water swimming after your best friend dives in, or vinyl reissues if your circle is deep into music. No spy mic required—behaviour beats words.

We also talk product realities. Smart speakers started simple, users learned their limits, and even as features improved, trust lagged. Hence those nudges mid-task, a trade-off between discovery and annoyance. Meanwhile, data retention and privacy are shaped by regulators—led by the EU—while companies push for more data to serve the “long tail” of obscure but important questions. We share examples—from kitchen timers to niche medical insights—of how scale turns data into relevance.

Our bottom line is practical and honest. Free apps deliver real value—maps that never get lost, messaging that shrinks distance, feeds that surface what you care about. In return, we hand over behaviour and metadata that make ads sharper and recommendations feel uncanny. If you’re uneasy, start with permissions, location settings and usage habits; the most revealing data is not what you say out loud, but what you do. If you’re comfortable with the trade, enjoy the discovery, eyes open to how it works.

Enjoyed the conversation? Follow the show, leave a review, and share this episode with someone who swears their phone is listening. Your take: fair exchange or too much data—where do you land?

Like, Subscribe, and Follow the Tech Overflow Podcast by visiting this link: https://linktr.ee/Techoverflowpodcast

Hannah Clayton-Langton:

I've got smart friends who just won't believe it. You're giving these companies pretty much everything that uh that they could possibly know about you. Hello, Hugh.

Hugh Williams:

Hello, Hannah.

Hannah Clayton-Langton:

Welcome to season two of the Tech Overflow podcast. You might notice that I'm sitting in your house.

Hugh Williams:

You are in my house. You're in what used to be my study, and one day it will become a place where I can listen to music, hence all the vinyl records.

Hannah Clayton-Langton:

But for now, it's very much a I think saying it's like a homegrown podcast studio undersells it because you've definitely gone full tech guy with all of the equipment that's surrounding us.

Speaker:

Yeah, and I've learned a lot about how to record audio and how to record video and video capture cards and all kinds of things, lighting setups. Hopefully I've done an okay job for a complete amateur.

Speaker 1:

Someone did ask me for a podcast episode about how you record a podcast. So it's a little bit meta, but it sounds like you're now fully qualified to lead on that as well.

Speaker:

Yeah. Yeah, yeah. And then we can next season do one about how to record a podcast episode about how to record a podcast episode.

Speaker 1:

Okay, so it is February 2026. It is. How have you been since we closed our season one?

Speaker:

I've been great. I did the promised high rocks that we talked about in the season rap episode. Family came for Christmas, all those kinds of things. Obviously, summer here, plenty of time in the pool, down the beach, all that good stuff. Um, and then season two crept up on us very, very quickly. How about you? How was your host season one experience?

Speaker 1:

Uh well, it was November, December, and January UK. So I can't conjure up images of sitting by a pool. Um, I worked a lot on the the chief of staff and lead the strategy team for a UK tech company, and we were pretty busy through the end of the year and start of 26. So I basically did a lot of work, made the journey down a couple of days ago, ready to hit the ground running on season two.

Speaker:

Awesome. So season two, it's gonna be a little bit different to season one.

Speaker 1:

I would say it's a build.

Speaker:

A build.

Speaker 1:

Yeah.

Speaker:

Yeah, I like that.

Speaker 1:

Season one, we were figuring out how you actually record a podcast, both in terms of content and the technical side of it. I learned a lot about time zones in season one. And we pushed out a bunch of episodes and you guys listened and loved them, which was great. And Hugh has been breaking down the stats on them. So, what have we learned? What are the numbers telling us?

Speaker:

Numbers are telling us that we're building, and I thank everybody for following and subscribing to the show, engaging with us on socials, but we are we are building. What I also learned is that we do best when one of two things is true. Either we cover a topic that absolutely all of our listeners are interested in.

Speaker 1:

Like AI.

Speaker:

Like AI did super, super well. Driverless cars, our interview with Nick Pally, everybody was interested in that. So when we do that, things go super, super well for us. And then also, if we tell a great story, that really resonates with our listeners too.

Speaker 1:

Okay, so relevant content, great stories. Sounds fair.

Speaker:

Sounds fair. So more relevant content, more great stories. That's the focus for season two.

Speaker 1:

So I've been uh scrutinizing Hugh's little black book of contacts, and we've got some, no spoilers, but we do have some pretty epic guests lined up.

Speaker:

We do, we do. And I know our two episodes in season one where we had guests were two of our best rating episodes. No surprise. Uh, I thought Jonathan Bedeen and Nick Pelle were fabulous, fabulous guests. So, you know, more guests, I think uh I think they'll be super popular episodes. I can't wait to hear what uh all of our listeners think.

Speaker 1:

And when we're not with guests this season, Hugh, what are we going to be talking about?

Speaker:

Well, I think we're gonna lean a little bit more into AI. I mean, it's certainly uh it's certainly the topic de jour at the moment. So we'll definitely lean into a little bit more around AI, a little bit more around sort of tech as it impacts uh everybody, smartphones, those kinds of things. So a little bit more of that, home devices, things that I think all of our listeners are encountering every day. We'll try and demystify those for uh for our curious listeners out there.

Speaker 1:

And guys, I'm already loving Australia. I've been here about 48 hours, not including the flight down. So if we could just nail season two, I'm inviting myself back, Hugh.

Speaker:

Yeah, we need that podcast deal, Hannah. And then you your flights will be paid for. This will be a professional thing.

Speaker 1:

Won't be back of the plate.

Speaker:

Yeah, we need that podcast deal.

Speaker 1:

Yeah. Yeah, yeah. Let's make it happen. I think we've got a good set of listeners and a good set of topics. I think we can make it happen.

Speaker:

And if you a listener want to help us, I'd say the most important thing you could do would be review our podcast. So go to Apple Podcasts and write a review. Be super, super great. And of course, follow and subscribe to the show and engage with us on socials. We'd love all of that.

Speaker 1:

And please, please recommend us to your friends, family, colleagues. Um, that is a great way for us to grow, is word of mouth. If you like what you hear, please share the love. I'm sure your friends and family will be grateful. All right, so season two, episode one. Is Alexa really listening? And I'm sure other people have had this moment where they open their phone. For me, it's often on Instagram. And like Instagram knew I was coming to Melbourne, like the day we booked, maybe even before we booked the flight. And I mean, I personally find it quite useful because it fed me lots of tips and recommendations, but that's kind of crazy.

Speaker:

Yeah, it is. And uh, you know, if I think back to um my early days at Microsoft and, you know, the great people that I worked with, I'd say if I fast forward a little bit, some significant fraction went to hedge funds. Um, so some of these smart people off uh off making money, and I'd say some significant fraction went off to ads and are making sure that uh highly relevant ads end up in front of you and uh that these these feeds work really, really well. So some of the greatest minds of our time are definitely making sure that you get ads for Australia.

Speaker 1:

Well, controversial take, but I'm kind of grateful because the ads are they're feeding me stuff that I want. They recommend clothes mostly that suit exactly what I didn't even know I wanted. Sometimes I even like if I want something, I want a particular item or I want like a new bag or something, I'll search it a bit on Instagram just to get the algorithms going because I want it to feed me inspiration.

Speaker:

Yeah, and um, you're doing all the work you need to do to make sure that uh you get those ads for a while.

Speaker 1:

Well, yeah, that's the thing. And then sometimes it keeps going, and I'm like, I've already bought one. I don't want to, I don't want to know what I could have done better.

Speaker:

It's one of the hardest problems in ads, actually.

Speaker 1:

Really? Okay.

Speaker:

Yeah, absolutely. Is uh, you know, you're on Amazon, you've you've got a friend who's had a baby, you do some shopping for baby goods, you buy them the baby goods, you ship them the baby goods, and then uh, you know, you get baby goods forever. Which I don't know, maybe for a certain demographic is fine, but for me, I'm like enough of the baby goods. Uh I'm not in that phase of my life. Um, but it's a hard problem to figure out when somebody's what we call intent has has actually changed. And say it was, you know, ephemeral. Hard to figure it out.

Speaker 1:

Okay, so we've kind of got into the subject matter, but I mean there's this age-old myth that our phones or our Alexa or Google homes are listening to us, and for the record, that is technically possible. Like, I'm not saying that that's what's happening, but you have a device in your home with recording device on it, and it's got access to the internet. So, like someone could be hacking that and like tapping in. Not that I'm having particularly interesting conversations at home, but that is a technical possibility, right?

Speaker:

Totally, totally, totally is a possibility. And and look, you know, we'll we'll probably talk about it a little bit later on in the in the show. But you know, folks like Amazon do pay people and they sign an agreement so that they can record them, you know, 24 hours a day across their house to try and understand, you know, what they're doing, the things they say, helps them develop a better Amazon echo, those kinds of things. But by and large, um, what is supposed to happen is that these devices will only activate when you say the keyword. And I'm a little bit hesitant to say it because I'm not sure. Yeah, the wake word. We but yeah, we're pretty, probably pretty uh but they're supposed to, you know, you've got uh you're supposed to say hey Siri and your um Apple device starts listening to you and listens until it believes that you've finished whatever it is you're saying, and then obviously, you know.

Speaker 1:

I wonder how many times in this record will activate the week word. So they will literally pay people. How do I get this? I want to be paid for just being at home using Amazon Alexa.

Speaker:

Yeah, there's actually um there's a couple of companies out there. Um, there's one that's been around for a long time called Appen, A-P-P-E-N. It's actually an Australian company based in Sydney. And a lot of the large tech companies, you know, whether it's Meta, Amazon, Google, Facebook, Microsoft, a lot of those folks will actually hire Apple to go and source humans from around the world to carry out tasks that they want carried out. So if, for example, you're working on the Amazon Echo and you're working on the Alexa product, you might go to Apple and say, look, I want to recruit some people across the world to, you know, do some task. It's going to help me improve the product. And then the Apple folks will go and recruit those people, train those people, get them to perform the task, do all the legal agreements, pay them, all those kinds of things, and then eventually send you back the back the data. But they've got this huge flexible workforce across the world that can do any task you you want done.

Speaker 1:

This is, I keep learning about this whenever we talk about stuff, and you'll be like, yeah, there's a whole industry behind it that can link people up to do whether that was like human in the loop checking for the LLMs or this kind of thing. And I guess they'll have to do it in like most major languages, right? Because Alexa's the same, at least in every language that I've seen people use it, which admittedly is probably less than 10. Um, but I assume it's pretty universal, Alexa.

Speaker:

Yeah, that's it. And they actually have naming teams as well, and some of them hire an agency to make sure the name works across the world. So I actually had a friend who came up with the name Pentium when uh when Intel invented the Pentium chip. And one of her jobs was to make sure that you know, Pentium was a fine word uh for Spanish-speaking people, Italian-speaking people, Arabic-speaking people, what whatever it is, and make sure that it didn't offend anybody. And uh, you know, it sounds pretty high-tech and sounds pretty good in all languages. And uh, I'm sure that the folks at Amazon probably did that with Alexa and the folks at Apple did that with Siri.

Speaker 1:

Amazing. And it must be annoying though. Imagine if your name is Alexa, you know, you were named before the famed Alexa. That must be slightly irritating.

Speaker:

Yeah, yeah. One of my uh youngest daughters um friends' sisters is her name is Alexa, and she was quite annoyed by it, but you can actually go into the settings and change it to a different keyword. That didn't make it easy when she visited her friend's houses. But you know, back to your key question, which is a great question, is um they should not be, they're not designed to record at all times. They're only designed to record when you say the keyword, and then they're designed to continue until they believe that you finish whatever it is that you're going to say, and then they should turn back off again. Though there has been a couple of legal cases, um, Apple was taken to court, Amazon was taken to court where you know people said, hey, you know, it sometimes records when it shouldn't be recording because it it thinks it should be. So it thinks it's been triggered. Um, it's this gray area, right?

Speaker 1:

It's like I say I used to work with someone called Shiri, and then every time I said Shiri, Siri would think that I activated it or, you know, anything that my brother's called Alex, so I'm sure they accidentally set off his Alexa all the time. Hi, Al, if you're listening. Um, but yeah, I mean that's it's a funny one because it means that they're collecting, like in the case of my brother, it would mean that they could be collecting a bunch of stuff that was really not intended to be for the Alexa. But what can they do if if the words sound similar? I mean, I guess it's it's kind of fair enough. Is that controversial?

Speaker:

Well, as I say, you can go into the settings and adjust the keyword. So you could pick a different keyword and uh, you know, pick something that's less likely to be triggered. And, you know, there's people working hard to make sure that it's not triggered unless it should be. And I'd say the 99 point something percent case is it should only turn on when you say the keyword. So Alexa is listening only when you say the keyword, it's not listening on.

Speaker 1:

But they are, I guess is a key point that we've maybe not stated, they are sending those clips back to head office to check for like quality of answers, and and that's like a product development. Is that kind of like when an app asks if it can send the logs back to the exactly that, exactly that.

Speaker:

So obviously, if you think about what's going on, you're saying some words, those words are translated into a digital form. That digital content is going back to the head office or to a data center run by that company, and then that gets uh you know transcribed out as text, um, which you know we obviously we use in the podcast to transcribe out our recordings, and you might be looking at the transcription now, that's automatically done. So the same kind of tech is used in the data center, and then you know, try and recognize what the question is or what the statement is, figure out what the answer should be, synthesize that answer as voice, send that voice back to the device and play that back to you. So they they need an internet connection to work. I don't know if you've ever been in a flat spot and tried to get Siri to work and it says, Oh, sorry, I'm not available right now. So it needs an internet connection and you know, your voice becomes text, something happens, some text is generated, that text becomes a voice, that voice you've chosen, and you hear that back on you hear that back on your device. But certainly whatever it is you've said has ended up in a data center somewhere owned by one of these companies.

Speaker 1:

And there's no way of turning that off.

Speaker:

No, no, it just wouldn't work. Um, so you know, obviously a lot of folks um who've listened to the show have listened to our large language model episode and realized that a lot of things going on in the cloud that power these. And you know, if you squint enough, it's similar kinds of technology that needs a lot of computing power and models and those kinds of things. So this kind of technology is really only possible in the cloud. Of course, you could you could probably get it to give some simple answers, like tell you the time or whatever it is just on your smartphone. But by and large, these are internet-enabled um services that operate in the cloud.

Speaker 1:

I think that's the number one question at home we ask Alexa. It's always what the time is. It's like the laziest. We both wear watches, we're never far from our phones, and yet I think it's got to be the number one. Alexa, what time is it? Always.

Speaker:

Yeah, well, in prep for this episode, I looked back through my Alexa chats or the family's chats because I've got all of those, so I can find out what my daughters were asking and things, which again, you know, you can ask whether that's a that's a fair thing or not, but I've got the whole family's Alexa chats, and I'd say the 90% case is people setting timers in my house. Yeah, setting timers, that's another one. That's the 90% case, and then the other 10% is probably a little bit of telling the time and converting temperatures between, you know, Fahrenheit and Celsius, but some pretty basic stuff for some pretty advanced technology.

Speaker 1:

And I think you said in the prep for this episode that like that's one of the biggest challenges for the product folks that look after Alexa, that it's I remember when it landed, it was like it was a huge new product. It was all these sort of projections that it would become another member of the household. And it's not really developed much past me. Uh, we use it for shopping lists. You know, if I run out of pepper or whatever, we'll say, Oh, Alexa, add pepper to my shopping list. And that is helpful, but it's that, it's the time, it's very little more than that.

Speaker:

And look, you know, it's a difficult product to work on, I think, because you put yourself in the shoes of the product managers who work on Alexa, right? The problem you've got is that lots of people out there have asked Alexa to do something. It didn't work. And so what you've learned as a user is Alexa can't do that. Now there's some product manager who works for Alexa, they're doing their research, figuring out what features to build. They they figure out that that's a common feature that everybody wants. They go and build the feature, they ship the feature, and now Alexa can do that, but you don't know that because you tried it once and it didn't work. So why would you try it again? So it's very, very hard to educate the users that the product is actually evolving because people learn by trial and error. And once they have an error, then they never try it again. So you'll notice now that sometimes it'll promote things. So, you know, you'll say, hey, set a timer for 10 minutes and it'll say timer set for 10 minutes. By the way, did you know that I can play your playlist from Spotify? And you and it's incredibly annoying. But that's the that's the product managers trying to advertise the fact that they've built a new feature. But it, you know, it creates this sort of cognitive dissonance, right? Where it's suddenly sort of almost putting an ad into a task that you want it to do. So it's sort of unfortunately got a little bit marooned by that because it started off with a basic set of features, people assume that's all it can do, and so you know, it's a basically a fancy egg timer, even though it's capable of doing a lot more today.

Speaker 1:

Yeah, that happens and it's so sometimes I'm a little rude, I have to admit. Um that's I'm not proud to admit this, but when Alexa can't get something simple right, if they're recording my reaction, it's probably not very flattering. So we know that Alexa and Siri, etc., aren't listening. So what creates this phenomenon? Because I could say that, right, to some of my friends and family, and they just simply won't believe you based on the precision and relevance of the ads that they get. So, like, convince me that my phone isn't listening to me.

Speaker:

And look, I've got smart friends who just won't believe it. They say, yes, it is. It's the only way that it could be serving ads like that. But let's let's just zoom out a little bit and think about the device that you've got in your pocket, right? So we talked about that back in season one, you know, what smartphones are capable of and things, but but let's just let's just think about a couple of the basic things right now. So, first of all, you're carrying it around um and you're going to different locations, and that location data is being sent back to somewhere. So if you're a user of Google Maps, you've agreed to share your location with Google, and now Google knows where you are, right? So if, for example, um, you know, you go to a surf shop, you want to buy some board shorts, a surfboard, whatever it is, and you're walking around the surf shop, you've just given, you know, and you've navigated there, you used Google Maps, you've just given Google the signal that you're interested in surf shops.

Speaker 1:

Does it know here if I because I go to the gym several times a week, does it know I'm in there? Not or only if I'm yeah, yeah, it does pretty accurately.

Speaker:

So there's a team at Google called the Blue Dot team, and you know, it's quite a you know, moderate size team, and their job is to make the blue dot as accurate as it can possibly be and try and you know give pinpoint precision to locating where everybody's but but what happens if I'm not?

Speaker 1:

Because I don't need Google Maps to get to the gym. I know where it is. Is it is it still it's running in the background if I unless I've agreed only to share my location when I'm using the app?

Speaker:

Yeah, that's right. So you've probably said always allow.

Speaker 1:

I try and set it off for apps unless I need to, because I know that it's tracking, but I mean it's you can't have maps without location on.

Speaker:

And look, you know, things like Instagram, for example, maybe shifting to a different company, Meta, you know, it's it's convenient to have the the location on because it it does do a better job of of serving content to you when it knows where you are. It also makes it easier for you when you're posting something on Instagram to select the location that you're coming up.

Speaker 1:

Well, that's clever that they've done that because everyone loves to post an Instagram story with the location or I guess a grid post. And when I've turned it off on Instagram, it's annoying because then you you have to turn it on every time. So they've they've kind of hooked you into sharing where you are based on that.

Speaker:

And um, you know, lots of apps do it. You're owning up to where you are, and so there's a lot of intent that comes from that. So if you zoom out, let's go back to Google for a second. If you think about sort of the Google ad ecosystem, you know, so when you run a search on Google, you get back ads at the top of the search results page. But number one, not unsurprisingly, perhaps the number one, if you type in surfboard, it's gonna serve you ads that are about surfboards. So that's the number one clue. The number two clue is location. That's the number two thing that Google uses in their ad ecosystem. And so, you know, Google cares a lot about Google Maps and building a great product that we all love and invest an enormous amount of money into that product, you don't pay for that product, but in return, you're giving Google your location, and that location is used to improve the ads that they serve to you. So you can learn a lot about a user from their habits. So they know you go to the gym every day, they can figure out a new intent that you have, you know, suddenly you're in a surf shop, you haven't been in surf shops before, you're in a new location that you haven't been in. I mean, you can imagine all the things that you can distill from a person's location if you have it all the time that they have their smartphone, which frankly is pretty much all the time that people are on the move these days.

Speaker 1:

Okay, so the Google example is good because you've linked Google Maps to Google search result. But for me, the big one is Instagram. And I guess maybe I guess TikTok is the same for people, or maybe Facebook if you're using it. Like, and is that that's because of Meta, right?

Speaker:

Yeah, that's right. And the meta, you know, the meta ecosystems are great too, right? So if you think about something like Facebook or Instagram, there's a lot of intent data coming from that, from how you use it.

Speaker 1:

And should we just break down for the use? So Meta is maybe not the biggest tech company in the world, but definitely top five, top ten. Yep. And they own Facebook, Instagram, WhatsApp, which is a big one. What else have they got?

Speaker:

They're they're the three primary, you know, consumer-facing products.

Speaker 1:

For me, it's WhatsApp. Like if you know what I'm not that it's particularly interesting, but it's probably stuff like, oh, I'm at the gym again. Or like, you know, I'll meet you when I'm home from the gym, or Yeah, Hannah's arriving tomorrow.

Speaker:

We need to buy a present for Bob, you know, all of these kinds of things, right? So you're you're providing lots and lots of context, whether through your use of Instagram, through your use of Facebook, through your use of WhatsApp. So that's a that's a big signal, and you can figure out a lot about people's intent by their location, right? So I don't have to say out loud, or maybe I've said out loud in my house, oh, I'm I really want to buy a new surfboard, and then suddenly I'm getting surfboard ads. It could also be because I searched for surfboards, or I went to a surf shop. It's not that Siri or Alexa or Google Home is listening.

Speaker 1:

Yeah, we're we're putting out signals, right? That's the big thing. And and we're putting out signals in what I'm texting you or WhatsApping you, what I'm searching for. And then they're like, it's like how I'm scrolling through Instagram, right?

Speaker:

Yeah, absolutely. Absolutely. If you think about any feed-based tool, right, there's lots of clues you're giving it by interacting with the feed, right? The most obvious one is if you tap on something, right? So if you tap on something and expand the picture, you're you're showing pretty strong intent that that's something of interest to you. If you scroll through the images, even more intent, right? Like, hey, I'm super engaged with this. So you're giving that signal back. If you skip something really fast, it's a great signal, right? I'm not interested in that. If I pause on something, watch, you know, more than a reasonable amount of a video, tells it something. If you like it. If you like it, obviously massive signal. Yeah, yeah. If you share it with somebody else, huge signal and reshare something. And message it to somebody else, you know, all of these kinds of things. So you can imagine working on that team and building a model that understands people's intent based on positive things they do and also negative one of.

Speaker 1:

Things I learned early on in my career was this whole difference between reported behavior and actual behavior. Like we can say that we do XYZ or eat XYZ, and actually, like that's not how we tend to behave. We tend to report aspirational behavior. But my Instagram scrolling, that'll reveal all actual intent. Yeah, yeah, yeah.

Speaker:

That's it. Absolutely. And and sometimes, you know, these companies know what you're interested in before you even do, based on your behavior. The other thing that, of course, your smartphone knows is its IP address, which is, you know, basically the internet connection that it's it's attached to right now, right? So you're in my house right now in the podcast studio, you're connected to my Wi-Fi. My Wi-Fi has a you know is connected to a cable modem. That cable modem has a unique identifier that's a lot like a phone number. Obviously, I'm also connected to that. Your husband's connected to that, my wife's connected to that, right? And so you've got other people in the house doing things effectively on the same phone number, on the same IP address. And so that's a pretty interesting clue as well, right? So if somebody in the family gets interested in surfboards, so suddenly, you know, I'm interested in surfboards, there's a fair chance that maybe you'd be interested in seeing a couple of surfboards as well. And probably a better example might be something for the home. I'm shopping for a car. Uh might be a good idea to serve my wife pictures of cars because that's probably a family decision. So there's lots of signals that are coming out of these devices that are really helping Target. And uh, you know, it might put something in front of you that you've only just started to think about, or maybe you mentioned, and uh, and you know, you might come to the erroneous conclusion that the device was listening. But the reality is, you know, there's a lot more signal coming out of your smartphone than probably, you know, you'd get out of recording your voice.

Speaker 1:

So here's a question. One of my biggest actions on Instagram is like sharing reels with probably like three to five people on repeat. You know who you are if you're one of them. But so sometimes what they're searching or what they're signaling comes up on like my feed. So a really good friend is like, oh, I'm I'm really getting into cold water swimming. And then I'll be like, I have been getting served cold water swimming content. Do you think it it knows that we're a very tight user group and that's oh my gosh, it's crazy.

Speaker:

And if you think back to, you know, look, I don't know what was in Mark Zuckerberg's head when he built Facebook, but one of the great innovations of Facebook was first of all, you had to be a real person. When you sign up, we have to you have to be a real human. So we're identifying you as an as an individual, which is unlike Twitter that's now become X, right? Where you could sign up with multiple accounts, make up fake names, whatever it is. So there's actual real humans was sort of the beginning of Facebook. And then there was the friend graph. You made friends with people, and those friends were friends with other people, and and so suddenly you had a graph of how the world is related to the world. Yeah, right. And you'd even you'd even say, This is my wife, this is my husband, it's my brother, you know, my relationship is complicated. You're telling it all kinds of things about how that graph works. You put people in lists called close friends, all those kinds of things.

Speaker 1:

Yeah, the top eight.

unknown:

Yeah.

Speaker:

And you know, you interact with their posts and you see their posts. And so basically you're you're giving the company you know incredible information about how you're related to other people. And obviously, you know, you can be related through different paths. If you're related through lots of different paths, then probably you're pretty close. Yeah, yeah. You engage with their posts and eventually you're able to infer things like if they're interested in it, then probably you are as well. And again, remember location's a thing too. So that probably knows that you go already go to the gym with this person and now suggesting open water swimming to both of you, right?

Speaker 1:

So there's nothing more jarring than seeing someone else's explore page on Instagram. Like if I opened yours or you opened it, you'd probably be really disappointed if you open mine. It's just like fitness content and videos of dogs or something. But yeah, they're they're so tailored and it becomes so real to you. It's like your whole world and your whole network. But you pick two different people from two walks of life, then they'll have potentially super different interests and be pulling different content down from the algorithms.

Speaker:

Yeah, absolutely. If you think about these these companies too, you know, like Google's ad business is an incredible business, right? That's first of all built on the keywords you're searching for and secondly on where you are. Metas is a much, much more effective ads business per dollar spend because they've got this graph of who you're connected to. They've got lots of engagement data of you just using this thing for hours and hours a day, interacting with things, not interacting with things. They've got the graph how you're connected to people, all of that.

Speaker 1:

It's like the world's biggest user group study that's going on at all times, everywhere.

Speaker:

Yep, absolutely. And so it doesn't need to listen to uh to what you're saying. It can probably do a better job than that.

Speaker 1:

Yeah, it definitely can. What am I saying out loud versus what am I mindlessly scrolling? Yep. And Facebook less so, but WhatsApp, like these are apps I'm using more than I'd care to admit, and they're free. I think I heard once on a documentary, it was like, if you're not paying for a product, you are the product and someone else is basically paying for you.

Speaker:

Yeah, I guess that's one way to think about it. I mean, the other way to think about it is look, it's a straight trade here, right? You get amazing apps in your pocket, things like Google Maps or Instagram that, you know, help you navigate the world, provide you with enjoyment, connect you to your friends. You get those for free. And so in return, you're agreeing to give these companies lots and lots of data that they can use to advertise things to you and you know, help advertisers sell their products. And also, you know, you're giving it data that can help it do a better job of advertising to your friends, your family, everybody that you're connected with. So it's an incredible ad network and probably one of the best ad products that's ever been built.

Speaker 1:

Well, I and I controversially love it because it helps me find the content and the products that I want to buy. Um, sometimes I get carried away. And controversially, like if Amazon was listening in my home, I mean, I'm not a very interesting person, and so they wouldn't be getting much out of me. And maybe if they were listening into someone else's home, they'd discover criminal activity and disrupt it. So I'm like pretty easygoing on privacy. You'll probably tell me why that's a bad approach when we finish recording.

Speaker:

Well, I just say we're all past the point of no return, perhaps. Yeah, like like we're all we're all using these products and we're using them an enormous amount of the time every day and giving these companies a huge amount of data. And so, you know, whether or not we let them record the occasional sentence that we say or whatever it is is, you know, maybe by the by at this point. And you know, maybe that's a maybe it's a sad situation we've let the world get into, but the reality is, you know, you you're giving these companies pretty much everything that uh that they could possibly know about you, and they probably just don't need a voice at this point.

Speaker 1:

No, that's my biggest takeaway of this episode so far is like what I say out loud in conversation at home is actually much less useful than what you hover over as you scroll down your Instagram feed. Yeah. And there's no way of how could I cut that feed if I really decided I had a problem with it? I mean, Instagram I could I could give up and probably read a few more books if I did. And I don't really use Facebook. I maybe write down a few people's birthdays or something and probably just get rid of that app. But WhatsApp's a tricky one because that's probably my most used and the one that I would pay a subscription to use WhatsApp because I use it to I use it to talk to you, I use it talk to everyone all the time at all times, right?

Speaker:

Yeah, I'd say it's less of an Australian thing for um probably for our Australian listeners. I'd say there's a lot more folks use iMessages.

Speaker 1:

That's like in the US, they find it really cute that we all use. Yeah, yeah.

Speaker:

But I'd say for you know, Europe, South America, um, probably Africa, I'd say WhatsApp's probably dominant, being a bit different in different jurisdictions. But you know, the point stands, right? Is that these that these have become a primary way that that people communicate. I know when I lived in the US, nobody called me and I never called anybody. I mean, we just messaged the whole time. I'd say, I'd say Australians pick up the phone a little bit more, but but still, you know, the primary way of communicating is is using texting of some sort, right?

Speaker 1:

And so the the takeaway here is we're feeding it's it's called metadata, right? Metadata is also just like the whole of all the data that we're pumping into these apps at all the time, right?

Speaker:

Yeah, look, probably the definition of metadata is to say it's it's you know additional information that's added in addition to the actual core data that you're sending. So let's imagine, for example, that um I type in a search into Google, the data would be the search, and the metadata would be things like where I'm located, the time of day, how fast I type the words, you know, these kinds of things. Oh my god. That'd be the additional data.

Speaker 1:

It will know what I'm doing as soon as I wake up. That is crazy. I I wonder if if you could pay for like a report back. Everyone obviously likes to think about themselves, myself included, but you could get that back. Kind of like how you on the Amazon app, you can like get all of your prompts back. If I could get the sum total of like Instagram, WhatsApp, et cetera, like meta could give me a breakdown of like my habits and like where I can improve, and like you need to not get up and open Instagram, like you need to be getting up and getting out of bed, or something like that or you spend a lot more time on fitness videos than you see seem to be doing at the gym or whatever. But that they could, I mean, people would probably buy that, right?

Speaker:

There's some probably some product managers, you know, have thought about those kinds of things, and I guess that's the basis of companies like Whoop, um, you know, Garmin's features, those kinds of things really is trying to change people's habits based on, again, collecting an enormous amount of data about people and metadata about people.

Speaker 1:

We should do more on on like fitness data later in this series, but um, it also makes me think of Spotify Wrapped, which has been going for years now, and everyone loves to. I mean, mine always just says that I listen to Tailswift. I'm so basic. Um, but they also this year, I don't know if it was there's something about this year, every app was doing a rapt, yeah. Which is it's kind of cool, although some of them were a little bit of a stretch, but it's that whole idea of like, here's you played back to you. And it's I think no one really cares about your Spotify wrapped other than you, and yeah, everyone's like pushing it out onto their Instagram or whatever.

Speaker:

Yeah, absolutely. And uh the folks at OpenAI build chat GPT did one too.

Speaker 1:

Oh, yeah, that's right.

Speaker:

Yeah, I think I made the mistake of um sharing the screen with you as I looked at it for the first time and probably learned more about me than you than you wanted to know.

Speaker 1:

But uh I learned that Hugh likes protein recipes but doesn't like eggs for breakfast. Is that right?

Speaker:

Yeah, that's exactly right. Yeah, yeah, yeah.

Speaker 1:

I need to look at mine. It's that's the thing that we'll just be like my chat GPT is gonna tell me that I'm like a hypochondriac. I'm just always Googling. Medicol. Yeah, okay. So much medical chat. I've revealed a lot about myself on this episode already. So thinking about Spotify, do they have, do you think they have my wrapped from like 2018, or do they just get rid of that data after a year or something?

Speaker:

So it really depends on the jurisdiction as to what how long you can keep the data and and what data you can keep. Um, but I'd say, you know, if you work at one of these large tech companies, you know, you want as much data as you can possibly get your hands on. I mean, I've been in this situation when I worked at Microsoft. I mean, I wanted to keep every single query that every user had ever typed associated with that user and all of their behavioral data of what they clicked on and what they didn't click on. And the reason I wanted to keep that's not so I can study the user or particularly know anything about a particular user, but it's because if you train with all of that data and build an AI model, um, you can have a better search experience for future users if you've got more data.

Speaker 1:

Okay, so it's the aggregate that's what they're really after.

Speaker:

Yeah. So the more data you've got, the better your search algorithms will be, or your Instagram feed algorithms or whatever it is. So all of these tech companies want to keep all of the data forever, but you know, different jurisdictions place different constraints on the organizations. And the way the world works, at least at the moment, is that the EU probably has the strictest privacy standards. I think, you know, you typically see the EU move first and say, hey, we don't want you to keep data for more than this period of time, or we want it to be more anonymized, or we want to give the users more options, more control. The tech companies, I guess, come to the table and try and negotiate a position that suits them. But again, in their minds, they want to keep as much data as they possibly can for as long as they can.

Speaker 1:

But it's interesting that you say, for people listening who are getting freaked out by this episode, it's really not about what I, Hannah, or you, Hugh are doing. It's about the collective habits of a number of people over a long amount of time.

Speaker:

Yeah, exactly. And it's particularly important for what we call in the tale. So, you know, it's pretty easy to figure out things that are popular and serve popular things to people, right? So if there's a, I don't know, popular pair of shorts right now that lots of people look like.

Speaker 1:

They don't need to put it on. Like, that's like Marks and Spencer's who we did a an episode on last season about their hack, but like one of the things that really annoys me that they do is they push a product and I love it. And then I click through and I can't buy it because it's sold out. So it's like, don't push me the stuff that's selling well, because then I'll just get frustrated when I click through and it's sold out.

Speaker:

Yeah, yeah, that's definitely a problem. Though a lot of the companies do push things that are sold out because it helps them get a signal about how many users are still interested. So when they do their next production run, they know how many to make. So having worked in a fashion company, I know that pushing things that are actually sold out is a good way to get a signal of how it's gonna be. Scarcity, okay.

Speaker 1:

Yeah, it makes me want it more as well. Because I'm like, oh, I knew it was nice, and everyone else thought it was nice too.

Speaker:

Yeah. Back to the EU, right? So the EU wants to protect user privacy, wants to give users more control. You know, the tech companies come to the table and they try and advocate a position that suits them. But what's in their mind, of course, is we want to keep the data for as long as we can and as much as we possibly can. And there's a bit of back and forth with, you know, different EU privacy commissioners or whoever it is. You go back and forth, eventually you reach some agreement. What typically then happens is that the US kind of fast follows, maybe a couple of years later.

Speaker 1:

Even under this administration, you think they would?

Speaker:

Well, I'm not sure. I mean, I I I think, you know, historically that's been the case. I guess we'll we'll know in a we'll know in a year or two how the current administration, you know, thinks about these things. But I'd say in general, the EU is the trendsetter. You know, you've got places like Australia who are pretty aggressive, but the EU tends to set some precedent and then eventually everybody follows. So you find the tech companies working pretty closely, pretty aggressively with the EU to try and advocate their position.

Speaker 1:

And even China, like China's obviously a little bit more lax on that. They probably heard this me say.

Speaker:

I'd say, uh, I'd say that's just a completely different space. You know, the the EU, the UK, Australia, the US, you know, have sort of rules-based systems where they create laws about what these companies can and can't do. I'd say in China, you know, largely the government does what the government wants, and largely the government has access to most of the most of the data that most of the companies have there. So it's quite a different sort of environment. I want to just finish off that point about the head and the tail and things, right? So, you know, I gave a bad example of saying, look, you know, it's not that hard to figure out what shorts to show you, you know, we know where you've been, what the weather's like, the kinds of things that you're searching for. So it's not a hard problem. What's a hard problem is is when you've got that sort of one-off, really detailed requirement that you've never had in your life before, that not too many people have. So, you know, imagine I know you don't own a car, but you know, imagine you had a car, it's an old car, you're having problems with some component in the car, you know, the cassette radio is not working, and you want to buy a new knob for the cassette radio, right? So you go into Google and you type, you know, cassette radio, Nissan Pulsar 1983 knob. That's a pretty obscure query, but there's gonna be some user somewhere in the history of time who searched for exactly that query or something very, very close. Maybe they search for the 1982 model and they've clicked on some results and they've skipped some results. If we're able to keep that data, then we can do a pretty good job of answering that query for you because we know what's a good answer and what's not. But you need a lot of data to answer a query like that to train a model and deliver great answers. Because people aren't searching it that much. That's right. So it's very, very rare. You know, that's one of the things that makes Google an incredibly successful company is that they've got lots of data and they're able to answer the hard queries that you really care about, whether it's about, you know, buying an obscure part for your car or some medical condition that, you know, a friend's got that's really obscure. Um, you need lots of data for that. And so that's one of the reasons that in the ads business, one of the reasons why these companies keep a lot of data.

Speaker 1:

The revenue that a company like Meta is that must be like the lion's share or at least the majority of their revenue income, right?

Speaker:

Maybe we should we could we should break this down in a future episode, but we should talk ads one day and sort of exactly how they work. But but yeah, absolutely. I mean, these are you know, Google and Meta, who own Facebook and Instagram, have two of the best businesses that have ever been built because their ads businesses are are incredible. They do a great job of serving content to you that perhaps you don't even know you want. You click, you make the advertisers happy.

Speaker 1:

You hear people say it like, oh, I love that jumper. Oh yeah, Instagram served it to me.

Speaker:

Yeah.

Speaker 1:

And then they use the money they get from the ads to pump in to make the product even better, so you keep using it. Yeah.

Speaker:

Yeah, yeah.

Speaker 1:

It's pretty obvious when you think about it. And to be honest, like I've said a few times, I don't really mind. I find it to be quite helpful in the modern world that we live in, but that's probably one fairly lax viewpoint on a spectrum where other people would feel very differently.

Speaker:

Yeah, absolutely.

Speaker 1:

Can I ask a question then? So I know that companies aren't interested in necessarily you as an individual, but like if I was a criminal and the government wanted to like, or the police or whomever wanted to know about my habit, can they get that from the tech companies? Can they like is that called subpoenaing? Is that is can they just say you need a handover so we know what they've been doing?

Speaker:

Yeah, the companies are approached by law enforcement all the time, you know, asked to provide data of different types and they have a lot of processes that they have to follow. Again, they have to operate within the law. The law enforcement organizations have to provide the right warrants or whatever it is. But yeah, absolutely, you know, this data is often used to solve crimes, um, prosecute people, whatever else it is. And I guess it's you know it's similar to the old days with the paper version, you know, where somebody knock on your door with a warrant and say, We're gonna we're gonna come in and take your office away in a truck and look through all of your papers. So there's a digital version of that as well.

Speaker 1:

Would a company like Meta ever have flags that meant that they would want to turn someone over to law enforcement?

Speaker:

Um no, no, I don't think so. I'd say, look, these companies don't think they're the police.

Speaker 1:

Yeah, yeah.

Speaker:

They leave they leave law enforcement to law enforcement. Obviously, they have to make sure that illegal things don't happen on their platforms within whatever the legal constraints are. So, you know, they have to prevent, you know, people selling weapons online or whatever else it is. So there's all sorts of, you know, interesting legal constraints around these companies, and they have to put in their best effort to do that.

Speaker 1:

Okay, but there there are examples of some stuff being done for the greater good, not just the greater good of whoever I'm buying my workout clothes from. But uh, I have a friend who worked in marketing for a cancer charity, and she was telling me about how they did like a collab with Boots, which is the biggest pharmacy in the UK, and they were able to discern from people's shopping habits that they needed to be checked for symptoms before they even realized there was a problem themselves, based on like what medication they were buying to alleviate symptoms.

Speaker:

Yeah, absolutely. And again, you know, you've got all this data about people's behavior, you know, when they wake up, what they do, where they go, the things they search for. I mean, you can use that data to to figure out all kinds of things about people, including medical conditions. In fact, um, I don't know if I told you this story, but when I worked at Microsoft, early days of Microsoft working on search, it's probably about, I don't know, let's call it 2007. I was really interested in queries and clicks. So what I mean by that is when somebody runs a query, what do they what do they click on? And how do you use that click data to inform, you know, better answers for future customers? So I was really, really interested in this stuff. And um, I thought as an interesting experiment, we had a list of data. I thought I'd look for people who were searching for particular drugs, you know, prescription drugs that that you can that you can legally buy. And then I'd have a look at the symptoms that they'd search for in the same session or in other sessions, right? So let's imagine you buy some amino flames.

Speaker 1:

Welatonin is a great example. We were chatting this morning and my husband was probably yeah, and he was like, I took some melatonin on the plane, and then I've got some chest pain, and then I Googled it, and it turns out that that can be a side effect. So good news, I don't have a chat, a heart problem, but I am not I'm gonna stop taking the melatonin from my jet lag.

Speaker:

Yeah, absolutely. Great example. Or, you know, you take anti-inflammatories and you get stomach crank, yeah. Yeah, and so I basically looked for some brand name drugs, and then I looked for symptoms, you know, write some simple code to go and figure these things out. And then I thought, I'm gonna go and look up the FDA documentation and try and figure out if these are acknowledged side effects. And of course, yeah, most of them were, right? So you know, figure out people who search for a particular brand of anti-inflammatories would later search for stomach cramps. You say, yeah, well, that's a common side effect of these things. But I also found some side effects or what could have been side effects that weren't yet recognized side effects. You know, I said to my boss at the time, um, I bet these really are side effects of these drugs because so many people who are searching for these drugs are also searching for these symptoms, right? So if you've got millions and millions of people searching for a particular drug and then searching for a side effect, probably a side effect.

Speaker 1:

Wow. Okay, so you knew it before I guess because you can only trial a certain sample, which I'm sure is extensive for safety, but actually, like I said up front, like this is the biggest live user trial happening at all times. And okay, so would they ever, would Google ever like sell that back to the drug companies?

Speaker:

I wouldn't think so. No, okay. I wouldn't think so. You know, that's not their line of business, right? But I think I guess it just speaks to the fact that if you can keep a lot of this data, keep it associated with a user, you've got enough users, millions of users, billions of users in some cases, you can figure out these kinds. of things in a way that you know no clinical trial will ever figure out.

Speaker 1:

Yeah. Or like it will, if I search a brand, I'll get similar brands that aren't the brand I searched for on my ads, which is again super useful. Because it's like, oh, we hear you and now we're serving you something similar based on what other people were interested in.

Speaker:

The Instagram ads team's going to love you. Like uh because I love them. Yeah. And then and if you work on an ads team, that's what you want, right? Is you you don't want to be the annoying, you know, TV ad that's shown 10 times every hour in the middle of a sports event that everybody's sick of. You want to be an ad that's highly relevant, highly targeted, timely, and exactly what you're looking for. It's on the story and that's what you want to be.

Speaker 1:

That's what I want. I want on the stories when I can either flip past it or engage with it. Instagram ad team if you're listening, I don't like when you serve me ads on the feed. So if you could just sort that out, that'd be great since I've given you so much good promo. Awesome. Okay, so before we wrap up, I have revealed quite a lot about myself to you and the listeners. I would love to know who are do you have you use Instagram or not so much? Like what are your targeted ad groups?

Speaker:

Oh that's a great question. I would say I follow a lot of musicians uh on Instagram and so I get a lot of ads for you know vinyl records that reissues of things, you know, bands I'm interested in. So I'd say look, you know, I can when I do go use Instagram, I'd say the ads are highly relevant to me. But I'd say I'm kind of disciplined with Instagram. I um I actually only use it when I'm traveling. So whenever I'm traveling I try and post one picture a day sort of make a visual record of the trip that I'm on and then I delete the app and wait till my next trip.

Speaker 1:

Guys Hugh is more disciplined and clearly much cooler than me as well. Oh not sure final records and live meetings.

Speaker:

But uh we can talk about my use of LinkedIn which is extensive and often so I wouldn't say I'm social media free but I probably uh you know hang around more on the professional networks which makes me a lot less cool than you actually well swings and roundabouts I guess but I've revealed myself to be incredibly basic.

Speaker 1:

So in summary don't worry about Alexa or Google or your phone listening to you but maybe think about how much other data you're giving them via pretty much everything else you're doing on your phone.

Speaker:

Yep absolutely Hannah so they're really not listening. Alexa's not listening Siri's not listening Google Home is not listening but you're probably giving more data and more metadata than you realize with things like your search queries, your use of LLMs, scrolling through feeds, your location is a huge factor and you know you're handing all of these companies all of your behavior and your family's behavior and your friends' behavior and that's how they they can do such a great job of building the businesses that they've built.

Speaker 1:

But I like how you framed it when you described it as like it's an exchange things like Google Maps, WhatsApp, those have drastically improved my day-to-day life and so them extracting my data back seems fair enough, but it'll be interesting to hear back from the listeners on how they sit on that specific point.

Speaker:

Yeah, absolutely. It's been great to be back in the chair Hannah welcome to season two this is great fun.

Speaker 1:

Season two I've learned an incredible amount already and I'm sorry that I revealed so much about myself to listeners but uh it gives you a sense of who we are.

Speaker:

Yeah and if you're excited about the Tech Overflow podcast you can find us at techoverflowpodcast.com. We're also on the socials.

Speaker 1:

We're on Instagram indeed and X and Hugh's favorite LinkedIn you can also find us wherever you get your podcast. Please please like subscribe review and share with your friends colleagues family.

Speaker:

We're really excited about season two and we want to help as many people learn about tech as we can we do indeed so thanks to all the curious people who've joined us today and we'll see you soon for episode two. We'll see you next time