resource sharing beyond boundaries - apache myriad

Post on 18-Feb-2017

322 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Resource Sharing Beyond Boundaries   Mohit Soni Santosh Marella

Adam Bordelon Anoop Dawar Ben HindmanBrandon Gulla Danese Cooper Darin JohnsonJim Klucar Kannan Rajah Ken SipeLuciano Resende Meghdoot Bhattacharya Paul ReedRenan DelValle Ruth Harris Shingo OmuraSwapnil Daingade Ted Dunning Will OchandarenaYuliya Feldman Zhongyue Luo

AgendaWhat's up with Datacenters these days?Apache Mesos vs. Apache Hadoop/YARN?Why would you want/need both?Resource Sharing with Apache Myriad

What's running on your datacenter?Tier 1 servicesTier 2 servicesHigh Priority BatchBest Effort, backfill

RequirementsProgramming models based on resources, not machinesCustom resource typesCustom scheduling algorithms:Fast vs. careful/slowLightweight executors, fast task launch timeMulti-tenancy, utilization, strong isolation

Hadoop and MoreSupport Hadoop/BigData ecosystemSupport arbitrary (legacy) processes/containersConnect Big Data to non-Hadoop apps, share data, resources

Mesos from 10,000 feetOpen Source Apache project Cluster Resource Manager Scalable to 10,000s of nodes Fault-tolerant, no SPOF Multi-tenancy, Resource Isolation Improved resource utilization

Mesos is more thanYet Another Resource Negotiator

Long-running services; real-time jobs Native Docker; cgroups for years;

Isolate cpu/mem/disk/net/other Distributed systems SDK;

~200 loc for a new app Core written in C++ for performance,

Apps in any language

Why two resource managers?

Static Partitioning sucksHadoop teams fine with isolated clusters, but Ops team unhappy; slow to provisionResource silos, no elasticityWant to run Hadoop on the same infrastructure, without interrupting Tier-1 servicesWant multi-tenancy, resource sharing/isolation

Introducing Myriad

Myriad OverviewMesos Framework for Apache YARNMesos manages DC, YARN manages HadoopCoarse and fine grained resource sharing

Resource Sharing

Demo

Myriad improves Mesos Tighter integration with Hadoop frameworks

like HBase, Hive, PigBorrow resources from Hadoop when traffic spikes for tier-1 servicesBackfill unused resource capacity with best-effort Hadoop jobs

No Mesos code changes necessary

Myriad improves Hadoop Elastic scaling Fault-tolerant: Maintain NM capacity Share resources with other workloads,

improve resource utilization High SLA hadoop jobs unaffected No YARN/Hadoop code changes

Other FeaturesRM failover/discovery using Marathon/Mesos-DNSDistribution of hadoop binariesWeb InterfaceMyriad scheduler HA, task reconciliation(in progress)Ability to launch Job History Server(in progress)Your favorite feature here!

Learn More!https://github.com/mesos/myriad

dev@myriad.incubator.apache.orgMYRIAD JIRA

Apache Myriad Incubator ProposalApache Myriad Incubator Status Page

top related