How Do You Install Apache Parquet?

Problem scenario
You want to install Apache Parquet on the Hadoop namenode.  What do you do?

This assumes that you have installed Hadoop.  For directions, see this posting.

Run these commands:

sudo su -
apt-get -y install pip
pip install thriftpy
pip install snappy

sudo apt-get -y install libsnappy-dev thrift-compiler

curl > /tmp/parquet-1.2.tar.gz
sudo cp /tmp/parquet-1.2.tar.gz /opt/
cd /opt
sudo tar -xvf parquet-1.2.tar.gz
cd parquet-1.2
sudo python build
sudo python install

Leave a comment

Your email address will not be published. Required fields are marked *