Big Data – Page 5 – CONTINUAL INTEGRATION

What Do You Do when Cassandra Stalls on “Initializing IndexInfo”?

05/27/201909/07/2019 0 Comments

Problem scenario
When you start Cassandra you see a message such as this:

INFO [main] 2018-02-03 08:45:55,257 ColumnFamilyStore.java:389 – Initializing system.IndexInfo

What should you do?

Possible Solution #1
Try rebooting the server. This could help the problem.

Possible Solution #2
This next one is merely a workaround. It is not a best practice.

…

Continue reading “What Do You Do when Cassandra Stalls on “Initializing IndexInfo”?”

How Do You Install Apache Rya on Any Distribution of Linux?

05/06/201901/16/2022 0 Comments

Problem Scenario
You want to install Apache Rya on Linux. What do you do?

Solution

Prerequisites
i. You need a server with at least 5 GB of total memory. You can create swap space with this posting. (Remember that 1 GB of RAM and 2 GB of swap space will be insufficient for installing Rya.)
ii.

…

Continue reading “How Do You Install Apache Rya on Any Distribution of Linux?”

How Do You Use Google’s Cloud Pub/Sub with Python?

02/17/201901/07/2022 0 Comments

Problem scenario
You want to use a Data Analytics or a Big Data tool that publishes messages and subscribes to listening to messages being published. You know GCP has a Pub/Sub tool. You know it supports synchronous and asynchronous messaging. How do you use it with Python?

Solution

Log into GCP via the web UI.
Go here: https://console.cloud.google.com/cloudpubsub/
Click “Create Topic”.

…

Continue reading “How Do You Use Google’s Cloud Pub/Sub with Python?”

How Do You Run Some Cassandra Commands to Create a Table?

01/22/201906/04/2019 0 Comments

Problem scenario
You want to create a table in Cassandra. How do you do this?

Solution
Prerequisites
Install and configure Cassandra. If you do not know how, click on the link for the distribution of Linux that you have:

Debian or Ubuntu
CentOS/RHEL/Fedora
SUSE

Procedures
You will have to create a keyspace,

…

Continue reading “How Do You Run Some Cassandra Commands to Create a Table?”

How Do You Troubleshoot the Error “intx ThreadPriorityPolicy=42 is outside the allowed range [ 0 … 1 ] “?

11/25/201806/04/2019 0 Comments

Problem scenario
You try to start Cassandra but you get this error:

“[0.000s][warning][gc] -Xloggc is deprecated. Will use -Xlog:gc:./bin/../logs/gc.log instead.
intx ThreadPriorityPolicy=42 is outside the allowed range [ 0 … 1 ]
Improperly specified VM option ‘ThreadPriorityPolicy=42’
Error: Could not create the Java Virtual Machine.
Error: A fatal exception has occurred. Program will exit.”

Possible solution #1
Migrate to Linux SUSE or a Red Hat family version of Linux (e.g.,

…

Continue reading “How Do You Troubleshoot the Error “intx ThreadPriorityPolicy=42 is outside the allowed range [ 0 … 1 ] “?”

How Do You Enter Data into a Cassandra Table?

11/08/201809/02/2019 0 Comments

Problem scenario
You want to insert data into a Cassandra table. How do you do this?

Solution
Prerequisites
Install and configure Cassandra. If you do not know how, click on this link and go to “Possible Solution #5” at the bottom to determine the distribution of Linux that you have.

Procedures
1. Create the table with this command:
CREATE TABLE contint(

…

Continue reading “How Do You Enter Data into a Cassandra Table?”

How Do You Install the Elastic Stack on Any Type of Linux?

10/26/201809/29/2019 0 Comments

Updated on 9/24/19

Problem scenario
You want to install Elastic Stack on different distributions of Linux with the same exact script. What should you do?

Solution
Prerequisites
i. You should have at least 3 GB of total memory (a combination of virtual memory and RAM) allocated to the server. If you need to add memory,

…

Continue reading “How Do You Install the Elastic Stack on Any Type of Linux?”

How Do You Get hdfs or Yarn to Start When You Get an Error Such As “Permission denied (publickey,gssapi-keyex,gssapi-with-mic)”?

06/12/201809/02/2019 0 Comments

Problem scenario
You try to use start-dfs.sh or start-yarn.sh. You received this message: Permission denied (publickey,gssapi-keyex,gssapi-with-mic).

What do you do?

Solution
You need to be able to ssh into the node without any Hadoop components. To help you troubleshoot, consider the following items on the server that is causing the problem (e.g., the DataNode server, but it could be the NameNode server itself):

…

Continue reading “How Do You Get hdfs or Yarn to Start When You Get an Error Such As “Permission denied (publickey,gssapi-keyex,gssapi-with-mic)”?”

How Do You Troubleshoot the Message “/usr/bin/mongodb/bin/mongod: error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or directory”?

06/02/201806/04/2019 0 Comments

Problem scenario
You are using a Red Hat derivative of Linux (e.g., CentOS/RHEL/Fedora). When trying to run a mongod command you receive this message: “/usr/bin/mongodb/bin/mongod: error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or directory”.

What should you do?

Solution
Get different installation media. If you try to install a .tgz file for Ubuntu on CentOS/RHEL/Fedora, you will get this message.

…

Continue reading “How Do You Troubleshoot the Message “/usr/bin/mongodb/bin/mongod: error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or directory”?”

How Do You Get hdfs or Yarn to Start the Jps Process You Expect to Start?

05/28/201806/03/2019 0 Comments

Problem scenario
You use start-dfs.sh and start-yarn.sh and there are no errors. They seem to work, but when you use the jps command, you do not see the service you expect. Why isn’t a jps process starting when you run one of these scripts?

Solution
The root cause could be a variety of root causes. Here are some potential solutions.

1. In these files,

…

Continue reading “How Do You Get hdfs or Yarn to Start the Jps Process You Expect to Start?”