Skip to main content

Scarlett: Coping with Skewed Content Popularity in MapReduce Clusters







To improve data availability and resilience MapReduce frame-
works use le systems that replicate data
uniformly. However,
analysis of job logs from a large production cluster shows
wide disparity in data popularity. Machines and racks storing
popular content become bottlenecks; thereby increasing the
completion times of jobs accessing this data even when there
are machines with spare cycles in the cluster. To address this
problem, we present
Scarlett, a system that replicates blocks
based on their popularity. By accurately predicting le popu-
larity and working within hard bounds on additional storage,
Scarlett causes minimal interference to running jobs. Trace
driven simulations and experiments in two popular MapRe-
duce frameworks (Hadoop and Dryad) show that
Scarlett ef-
fectively alleviates hotspots and can speed up jobs by . 



Popular posts from this blog

(26) Post | LinkedIn

(26) Post | LinkedIn : ► Trump was first compromised by the Russians back in the 80s. In 1984, the Russian Mafia began to use Trump real estate to launder money and it continued for decades. In 1987, the Soviet ambassador to the United Nations, Yuri Dubinin, arranged for Trump and his then-wife, Ivana, to enjoy an all-expense-paid trip to Moscow to consider possible business prospects. Only seven weeks after his trip, Trump ran full-page ads in the Boston Globe, the NYT and WaPO calling for, in effect, the dismantling of the postwar Western foreign policy alliance. The whole Trump/Russian connection started out as laundering money for the Russian mob through Trump's real estate, but evolved into something far bigger. ► In 1984, David Bogatin — a Russian mobster, convicted gasoline bootlegger, and close ally of Semion Mogilevich, a major Russian mob boss — met with Trump in Trump Tower right after it opened. Bogatin bought five condos from Trump at that meeting. Those condos were...