sds podcast episode 349: human-in-the- loop …€¦ · human-in-the-loop algorithms in retail ....
TRANSCRIPT
SDS PODCAST
EPISODE 349:
HUMAN-IN-THE-
LOOP ALGORITHMS
IN RETAIL
Kirill Eremenko: This is episode number 349 with Chief Algorithms
Officer at Stitch Fix, Brad Klingenberg.
Kirill Eremenko: Welcome to the SuperDataScience podcast. My name
is Kirill Eremenko, Data Science Coach and Lifestyle
Entrepreneur. And each week we bring you inspiring
people and ideas to help you build your successful
career in data science. Thanks for being here today
and now let's make the complex simple.
Kirill Eremenko: This episode is brought to you by SuperDataScience,
our online membership platform for learning data
science at any level. We've got over 2,500 video
tutorials, over 200 hours of content, and 30 plus
courses with new courses being added on average once
per month. So all of that and more, you get as part of
your membership at SuperDataScience. So don't hold
off. Sign up today at www.superdatascience.com.
Secure your membership and take your data science
skills to the next level.
Kirill Eremenko: Welcome back to the SuperDataScience podcast,
everybody, super excited to have you back here on the
show. And I've got a question for you. What do you
think Netflix, Amazon, Spotify and Uber all have in
common? Oh, that's right. They're all algorithms
companies. And right now, according to Forbes, we're
living in the golden age of algorithms. This is a time
when companies that are based on algorithms that use
algorithms, not just data science and data analytics,
but specifically algorithms, those companies are the
ones that are going to thrive. And today's guest, Brad
Klingenberg, is the Chief Algorithms Officer at Stitch
Fix.
Kirill Eremenko: So let me tell you a bit about Stitch Fix. Stitch Fix is
an online personal styling service. They were
established in 2011, so just a mere nine years ago. But
in that time, they've already accomplished so much. In
2017, they went public. As of February 2018, they
were valued at $2 billion. $2 billion. Right now, they
have over 3 million active users worldwide. And on
their algorithms team, which is headed by Brad, wait
for it, they have 125 people on that team. How cool is
that? And so in this episode, Brad walked us through
exactly what is happening at Stitch Fix, and how
they're using algorithms to their advantage. This
episode can roughly be broken down into three core
components. We'll be talking about algorithms quite a
lot, and you will hear about the sheer diversity of
applications of algorithms at Stitch Fix. It's mind-
blowing.
Kirill Eremenko: They use algorithms for everything from
personalization, to NLP to process texts, to computer
vision to process images, other models to solve the
traveling salesman problem. It's hard to imagine a
single activity in the business that is not underpinned
by an algorithm. So that's a very exciting component of
what we talked about. Another really cool thing that
you'll hear about is how Stitch Fix combines humans
and algorithms together. This is a company that is
completely destroying the myth that algorithms will
replace humans. In this company, algorithms work
side by side with humans, and you'll find out exactly
what that means, how they accomplished it, and how
people are satisfied with that process.
Kirill Eremenko: And finally, the third part is what will be interesting to
you if you're an executive or manager, you want to
lead a company at some point, it's about how to build
an algorithms-driven company. What does it take to
make sure that your business has algorithms in its
DNA? What are the trade offs, what are the
advantages, and what are the pitfalls to look out for?
In a nutshell, this is going to be an extremely
insightful episode about algorithms and algorithm-
driven companies. So without further ado, let's
welcome Brad Klingenberg, Chief Algorithms Officer at
Stitch Fix.
Kirill Eremenko: Welcome back to the Super Data Science podcast
ladies and gentlemen. Super excited to have you on
the show, and today's guest is Brad Klingenberg from
Stitch Fix, calling in from San Francisco. Brad, how
are you going today?
Brad Klingenber...: I'm doing quite well Kirill, thank you.
Kirill Eremenko: Very excited to have you on the show. It's been a while.
We originally planned this for several months ago and
then my schedule didn't work. Your schedule didn't
work. But finally we're here. We've going to be talking
about algorithms and Stitch Fix. Are you excited about
this?
Brad Klingenber...: Absolutely. It's a pleasure, and I'm glad we could align
the schedules and make it happen.
Kirill Eremenko: Yeah. Same here. And what I really appreciate, already
talking to you a little bit just before the podcast now,
you're so excited about the company you work for. You
talk about Stitch Fix, and we can feel it in the tone of
your voice. What is the one thing that you would say,
that makes you most excited about waking up in the
morning and going to work?
Brad Klingenber...: Oh, it's so hard to choose just one. So I might cheat
and say two. So Stitch Fix, just the problems that we
work on are just fascinating. And so I feel, after having
been here for more than six years now, I still learn and
get to think about new things every day, which is
exciting. And secondly, the people are great. So I think
that combination of working on interesting things,
with colleagues that you enjoy spending time with, is
pretty hard to beat, and keeps you engaged in the long
haul.
Kirill Eremenko: That's fantastic, to continue learning after six years.
Would you say your learning has shifted? Maybe at the
start was more about algorithms, and now it's about
leadership? Or is it still a combination of both?
Brad Klingenber...: I think it's a healthy combination of both. So my
journey at Stitch Fix, today I'm the Chief Algorithms
Officer, or lead our algorithms team which, which is
largely a data science and platform organization that is
focused on using data to improve our business. And so
I joined Stitch Fix about six and a half years ago. So
an individual contributor. And so certainly, it's taking
on people management and larger and larger teams.
The focus of my work has shifted. So there's definitely
a lot to learn there. But part of what's still so exciting
to be here, is we have these wonderful, difficult open-
ended problems around making our clients happier
and buying inventory. And there's really a richness of
this problem space where many of the fundamental
things we work on are just not solved. And that keeps
me engaged on a technical level as well as managerial
level.
Kirill Eremenko: That's, that's fantastic. And congratulations. This is a
first time I've heard the title Chief Algorithms Officer.
Did Stitch Fix invent this title, and what does the title
entail?
Brad Klingenber...: It's a great question. I can't make, with certainty, the
claim that it's unique. But it's certainly unusual. I
think what it reflects, as much as anything, is really
the unique role of the algorithms team in the
company. So data science is a really interesting field,
and you have different companies organized quite
differently. And very often you'll find data science or
machine learning and algorithm development is a sub-
function of engineering or other verticals at a
company. And Stitch Fix is somewhat unusual in
actually having an algorithms organization that is a
top-level department, alongside other functions of a
business. And I think this has been important
historically, and just generally represents the role of
data science at the company.
Kirill Eremenko: Yeah. It's a very cool thing that's going on now. I've
recently, well maybe a year ago, read a blog
somewhere, that before data-driven companies were
crushing it, but now it's the algorithm companies that
are crushing it. Like you look at Amazon, they have
their recommender system uplifts their revenue by
30%. You've got Google with their algorithm for
displaying ads. Uber with their matching algorithms.
And basically, all of the top companies are using
algorithms. And it's really cool to see a company in the
retail space, in the fashion space also following along
those lines. And would you say that this is something
common that your competitors are also using or is it
something a massive innovation that makes Stitch Fix
stand out?
Brad Klingenber...: I think certainly across technologies, the examples you
cited at a Google or Amazon, there are many places
where algorithms are used in core parts of the
business. That's fairly unusual, certainly in fashion
and retail. I think a lot of that has to do with, as
much, with the aim of the company, as what the data
that the companies have. And so one of the advantages
of the Stitch Fix model, which is, Stitch Fix's personal
styling service, a retailer, it helps people find what they
love. But not in the traditional sense of shopping. So,
we're not a store where you browse or filter your way
to inventory. It's really a service where a personal
stylist actually can make decisions, and we bet on
those recommendations by sending them to clients.
Brad Klingenber...: And as part of that, we close a feedback loop that's
unavailable to many retail models. So I'm sure if you
think about the last time you were trying something on
in a store, and it didn't quite fit, or you didn't quite the
style and you left behind in the dressing room,
chances are extremely good that nobody learned
anything about you or anything about the inventory.
In Stitch Fix, part of the magic of the model is our
clients are generally very excited to share feedback
with us. And so when we send things to clients, we get
to learn about what they and what they don't like.
That helps us better serve them, but also better
understand how to use inventory to make other clients
happy.
Brad Klingenber...: And so the algorithms we use at Stitch Fix, since the
top, this feedback loop that really opens a lot of
opportunities to use data, not just in a sense of
building a dashboard that is providing the insight into
something, but in a very active sense, where we can
actually use data to drive to recommendation
algorithms at the heart of the business, to make
decisions about managing inventory or marketing. And
so, I do think it's a bit of a part of a vanguard data
being used in a much more active way, in many
different businesses that I'd expect to see continue to
play out in the coming decades in other industries as
well.
Kirill Eremenko: Yeah, definitely great that you're leading the way in
that sense. You mentioned a bit about Stitch Fix,
about how you've closed the feedback loop and that
you're in the fashion and design industry. Could you
tell us a bit more about the company? So for somebody
who's never heard of Stitch Fix or never used the
service, how does it all work? How does it all play out?
Like for instance, if I want to go and buy something,
some item of clothing, could you walk us through the
customer experience? So you paint a bit of a better
picture of how the service works before we dive into
more, the algorithms and that side of things?
Brad Klingenber...: Yeah, absolutely. So Stitch Fix is a personal styling
service. The way it works from a client perspective is
you come to us and tell us a bit about your
preferences. So how much you typically to spend, how
you like your clothes to fit, and a variety of things to
help us get to better understand your sense of style.
And then with that, we'll actually pick some things to
send to you. So, it's unlike a traditional retailer or even
most e-commerce models, we're actually making
choices on the behalf of clients. So, rather than just
for yourself, Stitch Fix is actually making the choice of
what to send to you, and being a personal stylist, in
the truest sense.
Brad Klingenber...: And so of course, it's quite important to get that right.
And that's where the focus on using data to better
understand the preferences of our clients, and the
inventory that they're going to love becomes so
important. And so as a client, you'll get a shipment
that we call a fix, and have a few days to try things on
at home. Try it on, with your mirror, with other things
that you own. You keep what you love, and send back
the rest, and you are able to continuously engage over
time in that way.
Brad Klingenber...: More recently, we've also opened up some new ways of
engaging that are quite exciting. And so for example,
being able to get recommendations for an outfit that
goes with things that you've purchased with us in the
past. And so whereas we're really exploring more ways
to make it an even richer experience, but I think the
key distinction from traditional retail is really the
company acting as a personal stylist, and us making a
really literal bet on our recommendations.
Kirill Eremenko: So what kind of payment model is... Like for instance,
if I like an item, do I need to pay for it after you've
shipped it to me? Or is it a membership fee that I pay
every month? How is this monetized? What is the
revenue model for the company?
Brad Klingenber...: So the way it works is that you, as a client, would pay
$20 for effects as a styling fee. And then, if you keep
anything from your fix, that's applied toward what you
keep. And then if you keep five things, you actually get
a discount beyond that. And so you can look at the
styling fee essentially, is only something that you end
up paying if you don't keep anything. So if you keep
anything that we send you, it's applied toward that.
And then that model includes both the shipping to you
and any returns that you'd to make to the company.
Kirill Eremenko: Okay. Got you. And so that's what you mean when you
say, you close the feedback loop, that by seeing what
the client returned, you're able to adjust your
algorithm. So even if we take for instance, the best
retailer who takes care of all of their data possible, the
only data they actually see is... They don't see what
the client is trying on, but they see what the client is
purchasing. And then from there they can say, "All
right, so the clients purchased these five pairs of
jeans, or this specific pair of jeans. So we can learn
from that." But you can learn from that, and in
addition, you get to learn from what they didn't like.
Brad Klingenber...: Indeed. So that's exactly right. Learning why you
didn't like something is often just as useful as learning
why you did like something. So for example, if
something doesn't quite fit you because it's too big or
too small, that's a lot of information about how you to
wear things. But beyond the implicit feedback of what
people keep and what they choose to return, our
clients can also share much richer feedback. So we
ask structured questions about, "How do you like the
size?" Or the style of things that are sent. And even
have clients share freeform texts feedback with us.
Brad Klingenber...: So part of the model is really this combination of
algorithmic decision making, but also then, expert
human curation. So every fix that we send is actually
curated by a stylist who's a Stitch Fix employee, and
they engage with clients on a variety of unstructured
ways. So when you request a fix, you can include a
free form requests note. And when you're providing
feedback on things that you've received, likewise, you
can include freeform text feedback on what you've
liked, what you haven't, to help the stylists and also
our algorithms get to know you even better.
Kirill Eremenko: So just to clarify, that means that as a client, my
stylist is not just an algorithm, it's an algorithm and a
human being.
Brad Klingenber...: Yes, absolutely. And so when we're styling effects,
which is our term for choosing things to send to a
client, there's really this combination of
recommendation algorithms, and the tools that our
stylists use. But ultimately in every fix, every single
item that we select is picked by a stylist. And it's really
this human-in-the-loop process, and human-in-the-
loop in a very strong sense. So not just having expert
humans label things so that algorithms can learn, but
actually having an active human decision making in
our production loop.
Kirill Eremenko: I like that term, active human-in-the-loop. Much better
than just labeling stuff. That's very cool. And so I think
we'll get to that in a bit, about how humans and
machines work together at Stitch Fix. I think that's a
very important topic. But for now, I want to ask you
just to put this into perspective for people who are
listening who might even have used the service before,
but don't know the intimate details of the company.
Two questions. How many people do you have on your
team? And also, how many clients does Stitch Fix
serve?
Brad Klingenber...: So the Algorithms team at Stitch Fix, which is the
team that I lead as Chief Algorithms Officer includes a
data scientists and platform engineers. And we have,
today, over 125 people on the team. And so, a very
large organization, or company size. [inaudible
00:18:01] a couple things. I think, both the incredibly
important role that the data science plays at the
company, and the breadth of applications. And so,
happy to chat about that more. And then, to the
second part of your question, we have over three
million active clients. So [crosstalk 00:18:22] a large
population as well.
Kirill Eremenko: That's a lot of clients. You have a lot of data. I love this
about algorithm-driven companies, that the better the
service, the more clients you'll have. The more clients
you'll have, the more data, meaning the better the
service will become. It's a self-fulfilling prophecy.
Brad Klingenber...: Oh, absolutely. And I can certainly remember six years
ago when I just joined the company, the volume of
shipments we were making was much smaller. And
that has a real impact on the amount of time it takes
to run experiments and the amount of signal that you
have to work with. And today, we're able to run
experiments, much more quickly than I'd ever dared
hope, years ago.
Kirill Eremenko: That's crazy. When you joined the company, how big
was the team then?
Brad Klingenber...: So, the data science team was led by Eric Colson, my
predecessor in the Chief Algorithms Officer role. And I
joined as one of the first data scientists. And so I think
Eric and maybe three other people, depending on how
you count, something like that.
Kirill Eremenko: Wow, that's crazy. Okay. Very interesting. So that's the
scale of the company. Massive company. 3 million
active users. Over 125 people in the Algorithms team.
So let's dive into a little bit. Let's dissect this, the types
of algorithms you use. I think this could be a very cool
case study for somebody not necessarily even just in
the retail space, but who wants to see how AI and
algorithm can drive a company, and it can be at the
foundation of a business. How it's already possible,
even at the start of the 2020s. Oh, well, it's been
possible even in the previous decade you've been doing
for a while.
Kirill Eremenko: So the first question I would have is, there's two types
of filtering that are quite well known. Collaborative
filtering versus content-based filtering. In your
algorithms tour, which by the way, I recommend for
everybody listening to this, I will link to it in the show
notes. It's algorithms-tour.stitchfix.com amazing.
Completely blew my mind. I loved reading through it.
Such a volume of information and so diverse. It really
explains how this whole thing runs.
Kirill Eremenko: So I wanted to talk a little bit about some points in
this algorithms tour. So collaborative versus content-
based filtering. In the tour, it says that you use
collaborative filtering plus mixed effects modeling.
Could you go into a bit of detail on that? Why and how
would you use collaborative filtering plus mixed effects
modeling, and what are those things?
Brad Klingenber...: Yeah, absolutely. I think it's helpful, just as general
context, to think about the types of data that we have,
when we're thinking about what to send to people. And
broadly speaking, I like to think of there being three
categories. So there are things that clients tell us
about themselves. And that could be through the
initial onboarding survey where clients could tell us
the sizes they to wear and their budget, preferences. It
can also be things like, our clients are able to rate
images in our iOS app, and on the web, that help us
understand their style preferences. But generally
things we know about our clients.
Brad Klingenber...: On the other ends, Stitch Fix as a retailer, we buy and
hold inventory. And so as part of creating or buying
that inventory, we have a lot of information about it.
So that could include structured measurements, or
even subjective ideas about when you might want to
wear this, or different occasions it might be suited for.
And then finally, the third category is really feedback.
So as we send things to clients, learning about what
they like, in a way that helps us better serve them, and
helps us get to know them over time. But then also
helps us do a better job managing our inventory in a
way that creates better client outcomes.
Brad Klingenber...: And so, in general, in thinking about this
recommendation problem, you have those three
categories to draw from. And so, as you mentioned, a
content-based recommendations is something that
might be very focused on the things we know explicitly
about our clients, and things are know explicitly about
our inventory. And we know that there's quite a lot to
do there. And you mentioned the mixed effects model,
highlighted in the algorithms tour, is one example of a
type of algorithm, not unlike other applications where
you use things matrix factorizations where you're
essentially trying to learn either through a transaction
history or through feedback that people have provided,
representations of clients and inventory that are useful
for making future predictions. So not just something
that's an explicit attribute given by the client, or that
we know about our inventory.
Kirill Eremenko: So for instance, let's compare it to a very well known
company that has an algorithm in its base, Netflix,
right? Netflix is the, probably... It is the most valued
company in terms of movie production right now. And
I think it's actually been labeled the most valued
membership platform in the world. Anyway, the point
is, they use filtering, in terms of a content filtering.
They look at content-based filtering. They look at, "All
right, what movies have you watched, or what shows
have you watched? What are the shows similar to
those shows to recommend to you?" And then
collaborative filtering or... I don't know what they use.
It's proprietary information, but they could use
content-based filtering, plus collaborative filtering.
They could look at, "Okay, you've watched these
shows, and then this other person is similar to you in
these certain ways. So they've watched those similar
shows. What else have they watched? Now we can
recommend that too."
Kirill Eremenko: Whereas, what you can do is, you can do both those
things, but in addition to that, you can take it to the
next level. You actually have feedback. Your users tell
you implicitly, feedback, saying that by returning
items, you understand they don't those things. Plus, in
addition, they might fill in some surveys, or provide
your textual information. Maybe get on a call with your
stylist. Basically... Or explain attributes about
themselves. Basically, give you additional contextual
information that other companies such as Netflix
wouldn't have. And you're able to add that in. And
that's what is called the mixed effects modeling. Am I
on the right track here?
Brad Klingenber...: So the mixed effects modeling refers to a very
particular type of statistical model that it is useful, for
example, in learning empirically, that a... For example,
a particular blouse does better with clients of a certain
style preference than another. And so it's an example
of an algorithm that can make those kinds of empirical
inferences. One family of algorithm that we've used to
some success here is something called the
factorization machine, which is something closely
related to matrix factorization, but it makes it easy to
incorporate the types of explicit attributes that you've
mentioned.
Brad Klingenber...: And maybe to add, I think one, actually interesting
feature of the Stitch Fix recommendation problem that
makes it different from recommending movies in
Netflix, or music at Pandora or Spotify is, again, that
we have this human stylist in the loop. And so in
addition to learning through the choices our clients
make, and what to keep and what to return, and how
they rate things, we get to learn through the way that
stylists interact with our recommendations as well. So
being able to learn from what a stylist chose to send,
but what they didn't choose to send, that was
recommended by an algorithm. And so there's actually
this interesting dual feedback character to the
problem.
Kirill Eremenko: So basically, you're using pins or images that the
clients liked, in order to get additional insights, which
they cannot explicitly tell you, and add that into the
model. Tell us a bit about that. How does that work?
Brad Klingenber...: Yeah, absolutely. So one of the best examples of this
might be our style shuffle feature in our mobile app
and on the web, which lots of people rate different
images. And through those ratings, what we're eliciting
from clients is just their reaction to something, and
whether they like the style. And through looking at all
the many things that they share with us, then we're
able to derive some representations of their
preferences that are useful on many different parts of
the company.
Kirill Eremenko: Got you. And let's go into a little bit of technical details
here. I found a very cool example of how you deal with
this visual information. Because there's lots of ways
you could transfer visual information into machine
information. How does this whole system work, where
you create a vector for a specific style, or a specific
item even? Like you have vectors representing different
items. And then based on the cosine similarity,
basically the angle between the vectors, you can tell if
two items are similar or not. Is this a common
approach to encoding images for further comparison
and filtering? Or are there some innovations that you
can share with us, that you are able to create along
the way?
Brad Klingenber...: Yeah, it's a great question. I think... So, certainly,
recent years have brought a lot of incredible
improvements in computer vision generally. One of the
applications that we were just chatting about though,
it combines not just trying to process an image, but
really understand how a client reacted to it. And their
response to it. And so, one interesting aspect of our
business is because we have stylists who largely
interact with our inventory through software tools that
allow them to make accurate recommendations, is we
have great image assets of essentially all the inventory
that we carry. And by showing that to clients and
getting the reactions to it on a tool Style Shuffle, we're
able to learn an embedding or a vector representation
of the client preferences for that item, that are really
useful. And even simple applications as you noted by
looking at cosine similarities. And it's really the
application of those embeddings throughout the
business, that are really a differentiated application of
that data.
Brad Klingenber...: So for example, by playing Style Shuffle, we get to
learn about your visual style preferences, in a way that
leads to the really material improvements in the
recommendations that we make, and therefore the
outcomes that the clients have, how happy they are
with their fixes. And so it's really a very useful tool
that shows up in a lot of places, including helping to
pick a stylist for you in the first place. So helping to
choose from amongst the stylist, the good style you're
fixed, to find the ones that will be most successful.
Kirill Eremenko: So that's also another algorithm to not just pick your
clothes but pick a stylist in the first place. Right?
Brad Klingenber...: Yes, absolutely.
Kirill Eremenko: Wow. And what are the criteria there?
Brad Klingenber...: Well, there's a lot of things. I think part of the value
proposition for many clients, it's actually creating a
longterm relationship with a stylist who gets to know
you over time. And that free form interaction that I
mentioned earlier, being able to request things to your
stylist, and provide feedback, create [inaudible
00:30:06], in many cases, a real relationship. And so,
for many clients, they're happy to stick with a stylist
who's gotten to know them and really knows what they
love, and brings that human touch. Of course, for
clients who are getting their first fix, or looking for a
new stylist, there's a question of, "Well how do we find
that the stylist who you're going to like the most?"
Brad Klingenber...: And one of the important signals that goes into this is
really understanding the strengths and weaknesses
different stylists have, with respect to different styles.
So, we all have our own preferences, and you might be
better or worse at styling different types of clients. And
so we try, again using this idea of a vector
representation, or embedding representation of styles,
something we call [lasing 00:30:56] Style, truly use
that to understand the types of clients that stylists are
likely to be successful with.
Kirill Eremenko: I thought for a second, you said a vector
representations of stylists. That would be funny.
Brad Klingenber...: Yeah. At least the preferences that they share when
picking things for clients.
Kirill Eremenko: Yeah. Got you. Okay. All right. So you have algorithms
to find the right clothes. You have algorithms to match
customers to stylists. You probably have NLP to
process text, right?
Brad Klingenber...: Yes. NLP is an interesting thing for us too, because
again, the human in the loop nature of the problem,
it's quite difficult, for example, for even with cutting
edge NLP, to beat the stylist capacity to see a note
from a client that might be something like, "Hey, I'm
going on a vacation. Send me something for the
beach." To really understand that intent, it's hard to
beat the human capacity. At the same time, we can do
a lot of things with that text, algorithmically. And so
we work on, I think, learning from texts, both,
algorithmically, but then also through the role that our
stylists play in the process.
Kirill Eremenko: Yeah. And you can pick out some keywords, and help
stylists to sift through the text faster, I guess. I read
that you also have algorithms to even optimize how
your pickers go through your warehouse to pick up the
items that need to be sent. Is that true?
Brad Klingenber...: It is true. So we have a network of warehouses across
the country, that we use to fulfill our shipments to
clients. And within the walls of the warehouse, there's
a lot of opportunities for optimization, and again,
using the data we have, to help make things more
efficient. So we have a team focused on what we call
operations algorithms that among other things, thinks
about how to optimize the operation of our
warehouses.
Kirill Eremenko: Wow. That is very impressive. So basically you use
algorithms pretty much for everything. Is there
anything in the business you don't use algorithms for?
Brad Klingenber...: There might be a couple of small pockets here and
there, but I think actually one of the interesting stories
of the algorithm that we make at Stitch Fix is just the
breadth of engagement it has with the business. So
there's a lot of things we haven't even mentioned yet.
And actually, the fact that we have 125 people on the
team, more than anything, reflects the number of
different problems that we're working on. So if you drill
down into any particular problem, like our core
recommendation algorithms, you'll find there's less
than 10 people working on those problems. And so the
size of the team, more than anything, reflects that it's
really an enormous breadth of applications across the
business that the team is working on.
Kirill Eremenko: Yeah, it's incredible. How do you even keep your mind
on top of all these things as Chief Algorithms Officer?
To me, it sounds impossible to deal with all these
different algorithms in different parts of the business.
And then just keep up to date with what's going on.
Brad Klingenber...: I think there's couple of things that help with that. So
one is, I get to spend a lot of time with the team and to
understand the problems aren't working on. And it
certainly is a lot, and I certainly can't keep it all in my
short term memory all the time. I think beyond that
though, another special feature of the team is really
our strong culture of bottom-up innovation. And so the
development of better algorithms, or better ways to use
data to run the business doesn't rely on me having a
great idea, and then propagating that down the
organization to be implemented. But really, rather the
opposite. So some people who are closest to working
on the business problem really seeing opportunities,
exploring new models, and bringing that to life. And
so, my role of leader of the team is less somebody
directing things and a fine grain from above, and more
a like a gardener just trying to make sure conditions
are good so that all sorts of different things can grow.
Kirill Eremenko: I love that analogy. Analogy of a leader as a gardener.
That's perfect. Can you give us an example of a recent
bottom-up innovation that you think pop to mind?
Brad Klingenber...: Yeah, absolutely. I think many of the most significant
improvements we've made over the years have arisen
in this way. And I think even the Style Shuffle rating
experience that I mentioned, is generating such useful
data, was actually born out of just as a side project of
one of the data scientists on the team who decided to
try something, and prototype something. And it really
got traction, has become a real product here. But
that's something that is a great example of an
innovation that was not asked for in any executive
way.
Kirill Eremenko: Fantastic. I think this is a good time to make a small
plug for your hiring efforts with such a grand team,
and growing so fast. To me, it sounds like an amazing
place to work, especially when you can drive
innovation bottom-up, do experiments, choose what
area of business you want to work on, choose the type
of work, whether you want to do NLP, you want to do
traveling salesman problems, you want to do computer
vision. Pretty much all AI is there. You mentioned at
the start, before we started the podcast, you're hiring.
Can you tell us a bit about more about that? If
somebody is interested, what kind of roles you're
hiring for, and where can they apply?
Brad Klingenber...: Yeah, absolutely. So I would say in my time here, we
have been in a mere permanent state of hiring, and
that certainly continues to this day. The company and
team continue to grow. We seek people from a variety
of different backgrounds. And actually one of the really
fun things of working at Stitch Fix is just the variety of
backgrounds and skills that people bring. And this is
true within Algorithms, but in the broader company as
well. And so we have a team that has enormous variety
of different backgrounds, from the computer science
and statistics, to the sciences like physics, social
sciences, psychology, epidemiology. We really run in
the gamut. And so, we draw from a lot of different
backgrounds and find a lot of value in having that
diversity of training.
Brad Klingenber...: And in general, have a fairly strong preference for
people who are able to be generalists. And really, not
be too hyper specialized into any particular subset of
data science. And a lot of this has to do with the
business problems that we solve, and asking a very,
very open-ended question like, how should we manage
our inventory? How could we do a better job buying
things that our clients are going to like? Which are
just very open-ended problems that are amenable to
techniques from a variety of different fields.
Kirill Eremenko: Got you. And I'm looking at your careers page. You've
got algorithms platform engineer, data platform
engineer, data scientist, data acquisition algorithm,
data science and client recommendation, data science
and merchandise. Very, very cool. Most of these roles
are in San Francisco. Is your whole team based in San
Francisco?
Brad Klingenber...: It is, yes. So the algorithms team is in our
headquarters here in downtown San Francisco.
Kirill Eremenko: Okay, fantastic. All right. So if anyone's looking for a
cool job, this is your opportunity. Check out
stitchfix.com/careers/jobs. Okay. So the other thing I
wanted to ask you is, it's very evident that, and I think
I read this somewhere, that algorithms are a part of
the business model at Stitch Fix. Just by the way you
do business, this is a just like a bonus, not just like a
cherry on top of the cake. This is the cake of Stitch
Fix. What can other companies learn that? We've got
about 10% of our audience, executives and leaders
and business owners. What can other companies learn
from that? What takeaways would you recommend, or
you could share from having the algorithms as a part
of the business model?
Brad Klingenber...: So I think for Stitch Fix, for a very long time, and
much credit to Katrina Lake, our founder and CEO, for
foreseeing the opportunity. Data and algorithms have
been a key part of our differentiation strategy. And
really, to be able to really predict what people are going
to like, and to be able to give it to them in a fun and
convenient way. And going back to the framing of
making a bet on the recommendations, that's just
really important that we're good at winning that bet.
As a retailer, our strategy is not to carry the largest
selection of things, or to carry things at the very lowest
price, or to deliver them faster than other retailers, but
to find things that you love. And so it was really that
personalization, and ability to understand and learn
from you, is at the core of the business. And so, I think
to your question, we really have data and data science
at the heart of the business, and it's part of the core of
the strategy. It's not a second order optimization. But
really, a first order existential strategy for the
business.
Kirill Eremenko: Got you. And so, does that make it easier for the
business to grow and to understand its users? What's
the main advantage of having that data science as the
core?
Brad Klingenber...: I think in businesses where you first have the data, so
either the feedback loops exist, or there are ways to
learn from your customers or other stakeholders, you
are able to really take full advantage of everything that
you know. And even beyond using the algorithms to
make those decisions, having a culture that is very
data-oriented and empirical helps to make good
decisions about how to think about uncertainty, how
to think about measurement. And it really, in general,
I would say, just brings a more scientific approach to
driving a business forward. And so where the business
model supports it, I think, making investments in data
problems that are really of first order importance to
the businesses is the right way to go.
Kirill Eremenko: Got you. So increases certainty and predictability of
your business. What is the main challenge of having
algorithms as part of the business model, or having
data science at the core? We've seen the advantages.
They're very clear. It's obvious how now, you've grown
to this level, how much more streamlined things are,
and how different things are to a competitor that
wouldn't have that in a business structure. But what
would you say is the one or two biggest challenges
you've had along the way?
Brad Klingenber...: So we mentioned that we're hiring. I think data science
is just in a wonderful period of rapid growth. And
you're finding applications across many different
industries, which means that as a data scientist,
everybody has many different choices of what to work
on, and where to work. And so, one thing we're always
working on is bringing very talented people into the
company. And that's certainly a big part of what I
think about. I think another challenge of using data
and algorithms in this more active sense that we
discussed earlier, is trying to figure out what is the
best way to combine human judgment in algorithm
decision making? And when do you bet on an
algorithm, and when do you bet on human judgment?
And there's no fixed optimal answer to that.
Brad Klingenber...: And I think we're on something of a journey, of really
understanding what is that optimal boundary between
your algorithm making or human decision making?
And what is really, the right way for humans and
machines to work together in a way that enables folks
at such Stitch Fix to do things that they wouldn't
otherwise be able to do?
Kirill Eremenko: Let's talk a little more about that. This is a really cool
topic, which I identified before I started the podcast.
One of the interesting themes at Stitch Fix, the
humans plus machines. And if we think about it,
companies like... Again, taking the case of Netflix, I
imagine that all of that, the whole recommendation
process can be done entirely by a machine. And if we
take Amazon, especially there, with the volume of data
that's coming in, and number of transactions all the
time, again, that's most likely being done fully by
machines. So what is your view on the situation of
Stitch Fix? Do you think that the service that you
provide could ever be done entirely by machines, or
are humans an integral part of this whole setup in this
whole service that you have created?
Brad Klingenber...: So this is a question that I'm inevitably asked, anytime
I speak at public events, or give interviews. I think my
view is that the service would not be nearly as good
without the role of the human stylist in helping to pick
things for our clients. Nor is the work that we do on
building algorithms. It doesn't have the aim of actually
automating away the human component. I think the
way that we think about things is really how to
combine humans and machines in a way that that's
better than either alone. And so, the goal is really not
just automate things away, but to equip stylists with
capabilities they wouldn't otherwise have.
Brad Klingenber...: And I think, we talked about earlier, the possibilities
for interacting with clients where you can exchange
free form text, you can exchange image data, those are
all things you can work on algorithmically. And there's
really exciting advances, especially in recent years. But
it is quite difficult, still, to match the human part for
many of these things. And I think really, the human in
the loop is a really integral part of our strategy for
solving these problems. And I would expect it to stay
that way for a very long time.
Kirill Eremenko: Fantastic. And yeah, active human loop, as we
discussed at the start, very cool. How do your
stylists... What do your stylists say about this
experience for them? How do they find it, working with
artificial intelligence hand-in-hand?
Brad Klingenber...: It's certainly a core part of their experience. And I
think you can really, in some sense, think of the
process of picking items to send to clients as being
something of a dance between algorithms and stylists.
And so, we learn a lot from stylists. And so as I
mentioned, the feedback loops we use to improve our
algorithms are not just how clients respond to things,
and how they rate them, which of course is very
important, but also, the decisions our stylists are
making. And if we make a change to an algorithm,
does that make stylists have to search more for the
things that they want to send clients? Or does it make
them make decisions more easily? And really, taking a
view of styling as a combined system that involves
both algorithmic decision making and human decision
making.
Brad Klingenber...: And so we pay a lot of attention to stylists, or
interacting with algorithms. In both, quantitative ways,
so the stylists can tell us about the recommendations
that they're seeing, also in qualitative ways. So just
spending time with stylists and the leaders of the
styling organization to really understand their
experience with the tools, but also opportunities we
have to always be making things better.
Kirill Eremenko: Got you. And you mentioned that like three million
people, massive market, a huge market presence. We
spoke briefly before the podcast, that you're
expanding, or you've expanded to the UK. Tell us a bit
about that. Is that your first international expansion?
And what does it look like now, in terms of growing
your global presence?
Brad Klingenber...: It is. So, yeah. We were quite excited to expand into
the UK, and set up a business center. So that's our
first international business. The past few years have
been a story of expansion in the US market. So
starting as a styling service just for women and then
expanding to men's and kids and even different size
profiles, so were petite and plus size within women's,
for example. And so, international expansion is
similar, and a natural opportunity that we have. And
yeah, we're excited about growing our UK business.
Kirill Eremenko: Fantastic. And what does that mean for the algorithms
team? Are you going to stay in San Francisco? Or are
you going to start hiring in Europe as well?
Brad Klingenber...: So for now, everybody's still in San Francisco though
we do spend quite a bit of time with our colleagues in
London, both virtually and also by visits to London. I
always love to go when I get a chance. And yeah, it's
an exciting problem to work on. It's a different it's a
different market of clients that we have. Largely
different inventory. And when we launched of course,
there was a bit of a cold start problem. You have all
new inventory, all new clients, and starting a new
business. And of course, we have the benefit of
everything we know about our US business and all the
algorithmic approaches that work there. But it's really,
I think, as time goes on, the feedback starts to
accumulate and you can... There's a flywheel that that
kicks in, that helps drive things algorithmically.
Kirill Eremenko: That's very cool. And if you do, at some point, expand
your algorithms team to Europe as well, then you'll
have access to a whole new talent pool European
professionals and data center.
Brad Klingenber...: Absolutely. Yeah. I think that is a very compelling
consideration. And then thinking about hiring folks, as
they've very large pools of talent that exists. Of course,
not just in US but Europe as well.
Kirill Eremenko: Fantastic. And what does the future look for
personalization, using artificial intelligence? Because
that's predominantly the core part of the algorithm, or
output of the work with algorithms is personalization
and making it very user-specific, all these
recommendations. And that doesn't necessarily have
to be just applied in the space of fashion, design,
tailored clothing. It can be applied in lots of different
industries. From your experience, extrapolating just,
in general, onto personalization, what do you think the
future looks in terms of AI developments and data
science playing a huge part in this space?
Brad Klingenber...: So I think from a consumer perspective, in retail, but
in many other consumer services or products, we're in
an era of just overwhelming choice. And you see this
with retailers competing to carry the most broad
inventories. And Stitch Fix, I think, is a compelling
counter position to that. The service is really a
personalization and making it easy to find things that
you love. Not to show you 1,000 pairs of jeans that you
could choose from, but actually just to send you the
pair that fits just right, and that you love. And I think,
as you know, the value proposition there, while quite
well suited to fashion and apparel, need to be limited
there. And so I think, really, any industry where
there's this overwhelming burden of choice and
selection and having to do a lot of work to find things
that you love, is an opportunity for personalization,
which of course, there's a very successful one driven
by data to actually help people find things that they
love and relieve them of the burden of doing the
curation themselves.
Kirill Eremenko: I love it. I love that getting rid of that whole choice
paralysis problem. I think I've experienced that. You go
in. There's so many things to choose from. Before, and
I'll tell you, in my life, like even 20 or 15 years ago, you
go into a shop and there's only one thing that suits
your measurements, or what you wanted. You buy it,
then you're happy. Okay, it almost fit what I wanted.
But now you go into shop, there's at least 10 things
that would suit you, and they're all different colors,
shapes, sizes and so on. They're all perfect for you.
And then you buy one of them, and you walk out
feeling miserable because you are thinking of the
missed opportunity of buying all these other things,
and how much better they would have been. So it's
like you're bring us back to that feeling of satisfaction
from your purchases, because you're taking the choice
paralysis out of the equation.
Brad Klingenber...: Absolutely.
Kirill Eremenko: Yeah. It sounds very exciting. But then I've got an
interesting, more philosophical question then. Do you
think that... Is your work on algorithms, or these
kinds of personalization algorithms, will, or have the
potential of driving the world to a dystopian future
where we no longer can make choices for ourselves?
All we do is, "I'll wait for an algorithm to make a
choice. What are we going to eat? What are we going to
drink? What movie are we going to watch? Where are
we going to go for holiday? What are we going to
wear?" Basically, in that sense it's like taking the
human, not just out of the workforce, but rather
taking the human out of their life. And it sounds a
little bit scary, if you think about it that way. Maybe a
bit farfetched, but what are your thoughts on that?
Brad Klingenber...: Yeah, I wouldn't worry too much about that quite yet. I
think the general trends in so many settings is, again,
for this ever increasing selection, and just broader and
broader choices for people to make. And I think there's
certainly both risk and opportunity in narrowing what
you're exposed to, too dramatically. But I think the
prospect of that becoming so prevalent that it has
dystopian implications is probably pretty remote, in
my opinion.
Kirill Eremenko: Probably in the meantime, it's really great, what you're
doing. And another consideration is that it helps
reduce waste, right? When there's so much on offer,
things that don't get purchased and they go out of
fashion, they get thrown away. But when you can
personalize to the extent where you know your
customers really well, you can reduce the inventory
size. And that means the less waste, less carbon
emissions, less impact on the environment, things like
that. So even from that standpoint, this is the way to
go, I think.
Brad Klingenber...: Yeah. It's a wonderful aspect of this model, where
we're able to run a very efficient inventory, largely
because of how much we know about our clients. In
addition to this, the general structure of the model.
And something we haven't talked about too much
today is there's quite a lot of effort in the algorithms
team, in thinking about how to just make better
decisions about what to buy, and how to manage
inventory. And you can think about it in some sense
as this giant portfolio optimization problem. How do
we build an assortment of inventory that's going to
delight our millions of clients, and do so efficiently?
And the power of personalization here is really to
enable us to do a really good job, buying things we
know our clients are going to like, which, to your
point, in addition to being more efficient from a
business perspective, it's also more efficient and less
wasteful than other models.
Kirill Eremenko: Totally, totally. Well, it's been really cool dissecting all
of this and diving into some of the technical aspects
and also the business implications, ethical
implications. Really, really cool. But we're slowly
coming to the end of the podcast, and Brad I wanted to
ask you, do you have any final parting thoughts for
data scientists that are listening to this, who want to
progress their career, or want to maybe get into the
space of data science, who are by now, I hope, are very
excited about personalization and what AI can do in
this space? Any thoughts or wishes for them?
Brad Klingenber...: I think it's a tremendously exciting time to be in a
career in data science. And one I... Stitch Fix, and our
models are extremely interesting. But I think, even in a
larger sense, I think we're part of a transformation that
will play out over a lot of different industries as we
really bring data and algorithms and just scientific
approaches in general, to running and building
businesses. And so I think the opportunities will only
increase, and keep an open eye out for hard and
interesting problems to work on.
Kirill Eremenko: Fantastic. Fantastic. Thank you so much Brad for
coming to the show. Very exciting conversation. I really
enjoyed chatting with you. And before I let you go, can
you give us some places, online, where people can
follow you, connect with you, maybe follow Stitch Fix?
What are the best places to get more information and
stay in this loop?
Brad Klingenber...: Absolutely. I think more than anything, I'd recommend
checking out our blog, multi-threaded, which is a
technical blog of the company that includes the
algorithms tour that you mentioned, but then also a
pretty regular cadence of really interesting blog posts
about the way that we use algorithms at Stitch Fix.
Kirill Eremenko: Got you. So that's multithreaded.stitchfix.com. Very
exciting. And one more question I had for you before
we finish up, what is a book you can recommend to
our listeners for them to further their careers or just in
general their lives?
Brad Klingenber...: Yeah, great question. I think one really interesting area
of machine learning research these days is causal
inference. And this is something that there's really
active research there. And I think, for folks who are
interested in learning a bit about that, but might not
have encountered it in their training or experience,
there's a pretty general audience book called Mostly
Harmless Econometrics, that I think is a fun
introduction to the subject. And a little more broadly
for life, I've really enjoyed recently reading a book
called Origins by Lewis Dartnell, and which was about
the role of geology in shaping human history. And I
think it's a fun story of how we got to where we are
today.
Kirill Eremenko: Very cool. Have you read Sapiens by Yuval Noah
Harari?
Brad Klingenber...: I have. And it's in the same genre of large tellings of
the human story over thousands of years.
Kirill Eremenko: Yeah, it's really cool. And there's another one, Silk
Roads, Peter Frankopan. That's another really good
one.
Brad Klingenber...: Yup. One of my favorite genres.
Kirill Eremenko: Yeah. Very cool. Okay, thank you. So the
recommendations are Mostly Harmless Econometrics,
and Origins. On that note, Brad, once again, thank
you so much for coming on the show, spending some
of your time with us. I'm sure this is going to be very
inspiring to lots of data scientists out there.
Brad Klingenber...: Thank you. It's a pleasure.
Kirill Eremenko: So there you have it, everybody. I appreciate you being
on this show with us today, and hearing from Brad
Klingenberg from Stitch Fix. I hope you enjoyed this
episode as much as I did, and got value out of those
amazing insights that Brad was sharing with us. My
personal favorite part about our conversation today
was the sheer volume and diversity of applications of
algorithms at Stitch Fix. This is really cool to see how
a company is using algorithms in absolutely
everything they do, and at the same time, they are not
edging out humans. They are finding ways for
algorithms to coexist with humans and they're getting
the advantage of having both working hand-in-hand.
Kirill Eremenko: I'm really impressed at how Stitch Fix is, in my view,
leading the path of becoming an algorithms-driven
company. They're not just using it for one application
to set themselves aside from the competition. They're
using it across the board. They actually have an
algorithms team and the Chief Algorithms Officer. How
cool is that? Totally. Totally love this conversation, and
I'm sure you have your personal favorite takeaway
from today as well.
Kirill Eremenko: And as usual, you can find the show notes for this
episode at superdatascience.com/349. That's
superdatascience.com/349. There, you'll find the
transcript for this episode, as well as any materials we
mentioned, including the careers page for Stitch Fix. I
highly recommend checking out this company,
especially if you're in San Francisco. This could be a
fantastic place and sounds like a fantastic place to
work at. They're hiring. So check it out. It's data
science of all kinds in stores, whatever you like. I'm
sure you'll be able to find it at Stitch Fix.
Kirill Eremenko: Also in the show notes, you will find a link to the
algorithms tour that I mentioned. Also very cool read,
something to browse through. It will probably take you
more than a day to get through all of it. It's very well-
written, highly animated, very insightful. Highly
recommend checking it out. And finally, you'll also find
the blog of Stitch Fix that Brad mentioned. That could
be a very useful read for you as well. So lots of
materials there. The link is
superdatascience.com/349.
Kirill Eremenko: And one ask I have for you today, if you enjoyed this
episode, share it with somebody. Share it with
somebody who is interested in algorithms, who is
maybe in the space of personalization, or online
services, so they can get an idea of what's happening
in this space. If you know any data scientists, machine
learning engineers who are passionate about
algorithms, this could help them build their career
structure. And also if you know any business owners
who might be looking to build an algorithms-driven, or
at least data-driven company, Stitch Fix is a great
testament, a great example to strive for. And I'm sure
they would find it useful. Very easy to share. Just send
them the link, superdatascience.com/349. And on that
note, we're going to wrap up for today, and I look
forward to seeing you back here next time. Until then,
happy analyzing.