This is a training about Impala for developers covering the full stack of technical features, architecture and performance tuning. Impala supports analysis of large datasets stored in HDFS and compatible file systems, providing an SQL-like language. Other features of Impala include:
- Indexing to provide acceleration
-
Different storage types such as plain text, RCFile, HBase, ORC, and others.
-
Metadata storage in an RDBMS
-
Operating on compressed data stored into the Hadoop ecosystem
-
SQL-like queries