How Yelp (mostly) shut down its own data centers and moved to AWS

Ron Miller

Updated 4 June 2018 at 8:00 am

Back in 2013, Yelp was a 9-year old company built on a set of internal systems. It was coming to the realization that running its own data centers might not be the most efficient way to run a business that was continuing to scale rapidly. At the same time, the company understood that the tech world had changed dramatically from 2004 when it launched and it needed to transform the underlying technology to a more modern approach.

That's a lot to take on in one bite, but it wasn't something that happened willy-nilly or overnight says Jason Fennell, SVP of engineering at Yelp . The vast majority of the company's data was being processed in a massive Python repository that was getting bigger all the time. The conversation about shifting to a microservices architecture began in 2012.

The company was also running the massive Yelp application inside its own datacenters, and as it grew it was increasingly becoming limited by long lead times required to procure and get new hardware online. It saw this was an unsustainable situation over the long-term and began a process of transforming from running a huge monolithic application on-premises to one built on microservices running in the cloud. It was a quite a journey.

The data center conundrum

Fennell described the classic scenario of a company that could benefit from a shift to the cloud. Yelp had a small operations team dedicated to setting up new machines. When engineering anticipated a new resource requirement, they had to give the operations team sufficient lead time to order new servers and get them up and running, certainly not the most efficient way to deal with a resource problem, and one that would have been easily solved by the cloud.

"We kept running into a bottleneck, I was running a chunk of the search team [at the time] and I had to project capacity out to 6-9 months. Then it would take a few months to order machines and another few months to set them up," Fennell explained. He emphasized that the team charged with getting these machines going was working hard, but there were too few people and too many demands and something had to give.

"We were on this cusp. We could have scaled up that team dramatically and gotten [better] at building data centers and buying servers and doing that really fast, but we were hearing a lot of AWS and the advantages there," Fennell explained.

To the cloud!

They looked at the cloud market landscape in 2013 and AWS was the clear leader technologically. That meant moving some part of their operations to EC2. Unfortunately, that exposed a new problem: how to manage this new infrastructure in the cloud. This was before the notion of cloud-native computing even existed. There was no Kubernetes. Sure, Google was operating in a cloud-native fashion in-house, but it was not really an option for most companies without a huge team of engineers.

Yelp needed to explore new ways of managing operations in a hybrid cloud environment where some of the applications and data lived in the cloud and some lived in their data center. It was not an easy problem to solve in 2013 and Yelp had to be creative to make it work.

That meant remaining with one foot in the public cloud and the other in a private data center. One tool that helped ease the transition was AWS Direct Connect, which was released the prior year and enabled Yelp to directly connect from their data center to the cloud.

Laying the groundwork

About this time, as they were figuring out how AWS works, another revolutionary technological change was occurring when Docker emerged and began mainstreaming the notion of containerization. "That’s another thing that’s been revolutionary. We could suddenly decouple the context of the running program from the machine it’s running on. Docker gives you this container, and is much lighter weight than virtualization and running full operating systems on a machine," Fennell explained.

Another thing that was happening was the emergence of the open source data center operating system called Mesos, which offered a way to treat the data center as a single pool of resources. They could apply this notion to wherever the data and applications lived. Mesos also offered a container orchestration tool called Marathon in the days before Kubernetes emerged as a popular way of dealing with this same issue.

"We liked Mesos as a resource allocation framework. It abstracted away the fleet of machines. Mesos abstracts many machines and controls programs across them. Marathon holds guarantees about what containers are running where. We could stitch it all together into this clear opinionated interface," he said.

Pulling it all together

While all this was happening, Yelp began exploring how to move to the cloud and use a Platform as a Service approach to the software layer. The problem was at the time they started, there wasn't really any viable way to do this. In the buy versus build decision making that goes on in large transformations like this one, they felt they had little choice but to build that platform layer themselves.

In late 2013 they began to pull together the idea of building this platform on top of Mesos and Docker, giving it the name PaaSTA, an internal joke that stood for Platform as a Service, Totally Awesome. It became simply known as Pasta.

Photo: David Silverman/Getty Images

The project had the ambitious goal of making their infrastructure work as a single fabric, in a cloud-native fashion before most anyone outside of Google was using that term. Pasta developed slowly with the first developer piece coming online in August 2014 and the first production service later that year in December. The company actually open sourced the technology the following year.

"Pasta gave us the interface between the applications and development teams. Operations had to make sure Pasta is up and running, while Development was responsible for implementing containers that implemented the interface," Fennell said.

Moving to deeper into the public cloud

While Yelp was busy building these internal systems, AWS wasn't sitting still. It was also improving its offerings with new instance types, new functionality and better APIs and tooling. Fennell reports this helped immensely as Yelp began a more complete move to the cloud.

He says there were a couple of tipping points as they moved more and more of the application to AWS -- including eventually, the master database. This all happened in more recent years as they understood better how to use Pasta to control the processes wherever they lived. What's more, he said that adoption of other AWS services was now possible due to tighter integration between the in-house data centers and AWS.

Photo: erhui1979/Getty Images

The first tipping point came around 2016 as all new services were configured for the cloud. He said they began to get much better at managing applications and infrastructure in AWS and their thinking shifted from how to migrate to AWS to how to operate and manage it.

Perhaps the biggest step in this years-long transformation came last summer when Yelp moved its master database from its own data center to AWS. "This was the last thing we needed to move over. Otherwise it’s clean up. As of 2018, we are serving zero production traffic through physical data centers," he said. While they still have two data centers, they are getting to the point, they have the minimum hardware required to run the network backbone.

Fennell said they went from two weeks to a month to get a service up and running before this was all in place to just a couple of minutes. He says any loss of control by moving to the cloud has been easily offset by the convenience of using cloud infrastructure. "We get to focus on the things where we add value," he said -- and that's the goal of every company.

The Independent
‘Bollard man’: Frenchman who took on Sydney mall attacker offered permanent Australian residency
Damien Guerot engaged Cauchi on escalator, forcing him to flee
6 hours ago
Evening Standard
Australia attack: Sixth Sydney stabbings victim named as police say it was 'obvious' women were targeted
Yixuan Chen, 27, was one of six people killed in the attack by Joel Cauchi, who police believe targeted women while largely avoiding men
2 days ago
AFP News
Beijing half marathon probes 'embarrassing' win by Chinese runner
Beijing half marathon organisers said Monday they were investigating after footage shared widely online appeared to show three African runners deliberately allowing China's He Jie to win.Track and field's international body said it was "aware" of the footage online and the investigation into it.
a day ago
Bloomberg
What observers are saying as Singapore’s Lee paves way for next PM Wong
Singapore’s Prime Minister Lee Hsien Loong is stepping down on May 15 after two decades in office. Lee, 72, will hand the reins over to his deputy Lawrence Wong, 51, in a widely-telegraphed transition
17 hours ago
HuffPost
Ex-Aide Sums Up Donald Trump’s Attitude To Melania Trump With 3 Words
Stephanie Grisham also recalled a telling telephone call the former president made about his wife.
a day ago
Cinema Online
Natalie Tong is ready for a mature relationship
The actress admits that she used to have poor communication skills when it comes to relationships
a day ago
Yahoo News Singapore
Tourist arrested for allegedly stealing luxury goods from Changi Airport shops before her flight out of Singapore two months ago
Woman tourist, 38, arrested for allegedly shoplifting over $1,400 worth of goods on two occasions from Changi Airport. Read more.
16 hours ago
The Independent
Flight attendant shares her punishment for passengers who refuse to swap seats with parents
Those on board who choose not to switch are left with an inconvinient responsibility
a day ago
The Telegraph
What happened to David de Gea? The Golden Glove winner who cannot find a team
In the 10 months since David de Gea left Manchester United, the most high-profile free agent on the market has actually worn kit for a different club. The red shirt, however, was that of his esports team Rebels, rather than pulling on a No 1 jersey in the Premier League.
a day ago
HuffPost
Donald Trump Flips Out Over Barron’s Graduation Ban. Here's What The Judge Really Said.
The former president's disingenuous spin on his hush money trial whipped up anger on the right, including from his other sons Don Jr. and Eric.
11 hours ago
People
Married Teacher Allegedly Caught Undressed in Back of Car with Teen Student: Police
Erin Ward, 45, was allegedly found inside a car with a 17-year-old student, according to the Douglas County Sheriff’s Office
a day ago
Evening Standard
Chelsea: Mauricio Pochettino threatens Noni Madueke and Nicolas Jackson after penalty clash with Cole Palmer
There were boos around Stamford Bridge as the Chelsea players fought to take a second-half spot-kick
13 hours ago
Futurism
Trump Supporters Horrified as the Value of Their "Truth Social" Stock Evaporates
Screaming Into the Void Despite an astronomical number of red flags, investors who poured their hard-earned cash into former president Donald Trump's Truth Social meme stock are experiencing a rude awakening. As the Washington Post reports, Trump supporters are seething as the value of the social media platform's parent company Trump Media & Technology Group (TMTG) […]
a day ago
Bloomberg
Singapore Prime Minister Lee’s scorecard: A look at the numbers
Singapore Prime Minister Lee Hsien Loong, who’s set a May 15 date for passing the baton to his deputy Lawrence Wong, is leaving behind an economy that’s in a better position than he inherited it.
13 hours ago
Yahoo Lifestyle Singapore
Julie Tan went snowboarding after her breakup, inspired by filming Good Goodbye to step out of her comfort zone; had never planned such trips with ex
Singapore actress Julie Tan shared how the movie Good Goodbye inspired her after the breakup. Read more.
2 days ago
BBC
Bondi Junction mall attack: 'Obvious' killer targeted women, Sydney police say
Joel Cauchi, 40, fatally stabbed six people in a crowded Sydney shopping centre on Saturday.
a day ago
SWNS
Three people - all believed to be young men - die in "terrible" car crash
Three people have died and two more are in hospital after a late-night car crash at a retail park. All five were travelling in the same car and are believed to be men in their early 20s, police say. Emergency services were called to Staples Corner retail park in north London at just before 11:30pm last night (Sunday). Three people were pronounced dead at the scene while two others were taken to hospital. One is in a critical condition while the other person’s condition has been assessed as not life-threatening, the Metropolitan Police said.
a day ago
SETHLUI.COM
Ah Pui Satay, the illegal pushcart satay stall from the 1980s, has reopened
The post Ah Pui Satay, the illegal pushcart satay stall from the 1980s, has reopened appeared first on SETHLUI.com.
2 days ago
Evening Standard
Xabi Alonso: Liverpool hope as Bayer Leverkusen open door to exit after Bundesliga title win
Liverpool and Bayern both lined up the Bayer boss as their preferred manager target
8 hours ago
Yahoo Lifestyle Singapore
Dawn Yeoh on how her idols Fann Wong and Christopher Lee inspire her, and her mum's advice to not remain stagnant
Dawn Yeoh shared how celebrity couple Fann Wong and Christopher Lee helped her in her career.
a day ago

Straits Times Index

S&P 500

Dow

Nasdaq

Bitcoin USD

CMC Crypto 200

FTSE 100

Gold

Crude Oil

10-Yr Bond

Nikkei

Hang Seng

FTSE Bursa Malaysia

Jakarta Composite Index

PSE Index

The data center conundrum

To the cloud!

Laying the groundwork

Pulling it all together

Moving to deeper into the public cloud

Latest stories