HBASE + HUE
THE UI FOR
APACHE HADOOP
Abraham Elmahrek

LA HBase User Group - Dec 12, 2013

1
WHAT
IS HUE?
WEB INTERFACE FOR MAKING
HADOOP EASIER TO USE
Suite	
  of	
  apps	
  for	
  each	
  Hadoop	
  component,
like	
  Hive,	
  Pig,	
  Impala,	
  Oozie,	
  Solr,	
  Sqoop2,	
  HBase...

2
VIEW FROM
30K FEET

Hadoop

Web Server

You and even
that friend
that uses IE9 ;)

3
RE

O

ET
AS
T

M

B
BR

R

H

...

M
E

O

H

K

SP
AR

ER
Y

U

Q

R

IN

AD
M
DB

R

SE

U

ER

EP

P

SE

O
W

BR
O
O
O
KE

ZO

SQ

SE

BA

H

AR
C

SE

BR
A
O
W
SE
R

PA
L

IM

O
DE
W
SI
SE
G
O
R
N
O
ER
ZI
H
E
IV
E

B

JO

G

PI

SE

O
W

BR

JO

LE

FI

ECOSYSTEM

4
TARGET
OF HUE
GETTING STARTED WITH HADOOP
BEING PRODUCTIVE EXPLORING
DIFFERENT ANGLES OF THE PLATFORM
LET ANY USER FOCUS ON BIG DATA
PROCESSING
BEING COMPATIBLE WITH ANY HADOOP
VERSION (0.20/1.2.0/2.3.0)

5
OPEN SOURCE
2560+ COMMITS
31 CONTRIBUTORS
477 STARS
157 FORKS

github.com/cloudera/hue

6
THE CORE
TEAM PLAYERS
ABRAHAM ELMAHREK
ROMAIN RIGAUX
ENRICO BERTI
CHANG BEER

team.gethue.com

7
GROWING
COMMUNITY
Nov

Dec

Jan

Feb

Mar

Apr

May

500

0

Oct

Nov

17.5

0

Sep

35

125

Aug

52.5

250

Jul

70

375

Jun

Posts

Topics

hue-­‐user@

8
HISTORY
HUE 1

Desktop-­‐like	
  in	
  a	
  browser,	
  did	
  its	
  
job	
  but	
  preGy	
  slow,	
  memory	
  leaks	
  
and	
  not	
  very	
  IE	
  friendly	
  but	
  
definitely	
  advanced	
  for	
  its	
  Kme	
  
(2009-­‐2010).

9
HISTORY
HUE 2

The	
  first	
  flat	
  structure	
  port,	
  with	
  
TwiGer	
  Bootstrap	
  all	
  over	
  the	
  
place.

10
HISTORY
HUE 2.5

New	
  apps,	
  improved	
  the	
  UX	
  
adding	
  new	
  nice	
  funcKonaliKes	
  
like	
  autocomplete	
  and	
  drag	
  &	
  
drop.

11
HISTORY
HUE 3 ALPHA

Proposed	
  design,	
  didn’t	
  make	
  it.

12
HISTORY
HUE 3

TransiKon	
  to	
  the	
  new	
  UI,	
  major	
  
improvements	
  and	
  new	
  apps.

13
HISTORY
HUE 3.5

Where	
  we	
  are	
  now,	
  new	
  UI,	
  
several	
  new	
  apps,	
  the	
  most	
  user	
  
friendly	
  features	
  to	
  date.

14
BIG DATA MEANS HARDER SEARCH PROBLEMS.
HBASE APPLICATION DEVELOPMENT WOES.
UNDERSTANDING DATA BETTER.

15
HBASE BROWSER
PROBLEMS
LACK OF ACCESSIBILITY

LACK OF FAMILIARITY

USABILITY CHALLENGES

16
HBASE BROWSER
DESIGN CHALLENGES
PROBLEM

HBase	
  is	
  a	
  key-­‐value	
  store,	
  not	
  a	
  
tradiKonal	
  RDB
SOLUTION

Collapse	
  Sparse	
  Data	
  across	
  
HBase	
  Tables

17
HBASE BROWSER
FLEXIBLE SEARCH BAR
PROBLEM

Need	
  high-­‐level	
  tool	
  for	
  complex	
  
data	
  fetching	
  &	
  analysis
SOLUTION

Simple	
  custom	
  query	
  language
Supports	
  HBase	
  filter	
  language
Supports	
  selecKon	
  &	
  Copy	
  +	
  Paste,	
  
gracefully	
  degrades	
  in	
  IE
Autocomplete	
  Help	
  Menu

Row$Key$ Prefix$Scan$
Scan$Length$

Thri=$Filterstring$

Column/Family$Filters$
Searchbar(Syntax(Breakdown(

18
HBASE BROWSER
DOES IT SCALE?
PROBLEM

Performance	
  on	
  Millions	
  of	
  
Columns
SOLUTION

Lazy	
  loading	
  &	
  TruncaKon	
  using	
  
Thri`	
  FilterString
Stream	
  raw	
  data	
  cached	
  MEM	
  >	
  
DOM	
  elements	
  dynamically	
  
generated
b64encode	
  binary	
  data	
  to	
  
preserve	
  during	
  ASCII	
  dump
Detect	
  schema	
  on	
  preview	
  reading	
  
byte	
  headers
Capped,	
  Lazy	
  Loaded	
  &	
  Bound	
  to	
  
DOM	
  via	
  MVVM	
  PaGern

IT DOES.

19
ZOOKEEPER BROWSER

TECHNICAL DETAILS

Communicate	
  with	
  Contrib.	
  REST	
  
API
View	
  Node	
  Stats
View	
  and	
  Edit	
  Node	
  Data
And	
  and	
  Remove	
  ZNodes

20
QUERY HBASE
WITH HIVE

HIVE EDITOR

21
QUERY HBASE
WITH PIG

PIG EDITOR

22
INDEX HBASE
AND SEARCH IT
HUE

LILY INDEXER

CLOUDERA SEARCH

23
DEMO
TIME

24
MISSED
SOMETHING?

learn.gethue.com

25
WHAT’S MISSING?

SECURITY

Kerberos
SSL
HBASE-­‐8818	
  Thri`	
  V2	
  Please!

STARGATE?

Use	
  of	
  Thri`	
  interface	
  over	
  
REST	
  interface	
  for	
  HBase	
  
Browser

ANYTHING ELSE?

You	
  tell	
  us!

26
LINKS
WEBSITE

hGp://gethue.com
BLOG

hGp://blog.gethue.com
TWITTER

@gethue
USER GROUP

hue-­‐user@

27
GET HUE
CLOUDERA’S CDH

TARBALL

CLOUDERA’S DEMO VM

Stable	
  and	
  highly	
  tested	
  
releases	
  perfectly	
  integrated	
  
with	
  the	
  Hadoop	
  ecosystem,	
  
automagically	
  configured	
  by	
  
Cloudera	
  Manager.

Try	
  in	
  advance	
  the	
  latest	
  and	
  
greatest	
  but	
  you’ll	
  have	
  to	
  
configure	
  everything	
  on	
  your	
  
own.

Get	
  to	
  play	
  with	
  Hue	
  and	
  
various	
  Hadoop	
  components	
  
in	
  5	
  minutes.	
  It’s	
  a	
  self	
  
contained	
  CDH	
  environment	
  
ready	
  to	
  use.

HORTONWORKS*

MAPR*

HP CLOUD*

In	
  HDP	
  there’s	
  an	
  old	
  forked	
  
version	
  of	
  Hue	
  2.3.

Newer	
  version	
  than	
  HDP,	
  
close	
  to	
  the	
  original	
  2.5	
  minus	
  
apps	
  like	
  HBase,	
  Impala,	
  
Sqoop,	
  Search.

The	
  newest	
  addiKon,	
  ships	
  
Hue	
  3.0	
  through	
  the	
  
GreenBuGon	
  products.	
  

* YOUR MILEAGE MAY VARY.

28
TEAM@
GETHUE.COM

WHAT ARE YOUR USE
CASES?

INTERESTED IN
CONTRIBUTING?

WANNA SAY HELLO?

WHICH COMPONENTS DO
YOU USE?

WHAT WOULD YOU LIKE TO
SEE IN HUE?

DO YOU WANT A TAILOR
MADE TEAM RETREAT?

29
THANKS.

www.gethue.com

30

HBase + Hue - LA HBase User Group