How Do You Create an Ubuntu Linux VM in Azure Using PowerShell Azure?

Updated 1/12/19

Problem scenario
You want to create an Ubuntu Linux server with one vCPU and one GB of RAM in Azure using PowerShell.  You want to use Azure Resource Manager (an ARM template).  You do not want to use the Azure Portal for 95% of the work involved.  What do you do to automate the process (including the deployment of a resource group, a network security group, a subnet) so you can have a VM that is ready to use?

How Do You Create Servers Using .JSON Files and Resource Manager in Azure?

Problem scenario
You want to automate the creation of servers in Azure.  You want to try ARM templates to learn more about them.  How do you use .json files and Azure PowerShell to do this (without using the web UI)?

Solution
Prerequisites
i. You have a way of running Azure PowerShell commands.  If you need assistance with this, see this posting if your workstation is running Windows 7 or this posting if your workstation is running Windows 10. 

How Do You Install UiPath for Free to Try out RPA?

Problem scenario
You want to install UiPath Studio CE to test out RPA on a Windows 2016 Server.   How do you get a free version of UiPath (e.g., Studio CE)?

Solution
Prerequisite #1
You will have to agree to this agreement on UiPath’s website if you want to use the product.

Prerequisite #2
For this test, we recommend at a minimum 1 CPU at 2.3 Ghz and 2.25 GB of RAM. 

How Do You Resize a Server in Google Cloud Platform?

Problem scenario
You want to increase the amount of RAM that a VM instance has in GCP.  How do you do this?

Possible Solution #1
1.  Log into Google Cloud Platform
2.  In the “Cloud Engine” screen click on VM “instances.”
3.  You should see a list of VM instances.  Under the “Recommendation” tab you will see “Increase perf.”  Click on this. If you do not see “Increase perf”,

How Do You Get the Hadoop Yarn Web UI to Work?

Problem scenario
You want to get Hadoop’s web UI to work.  You have access to the back-end of a Linux server.  How do you get the front-end of Hadoop (or hdfs) to work?

Solution
1.  Deploy Hadoop.  See this link for directions for any type of Linux. 
2.  Start the Hadoop services.  For a single-node deployment of Hadoop use sudo find / -name start-dfs.sh to find it;

How Do You See the Storage Usage on a Datanode in an hdfs System?

Problem scenario
You want to see which datanodes are active underlying a given hdfs system.  You also want to know statistics about the storage usage.  If you are regularly adding data to your hdfs system, you want to stay below 70% utilization.  If you want your hdfs system to perform well but you are not regularly adding new files, you want to stay under 80%.  How do you find out about the storage usage of your datanodes that support your hdfs system?

How Do You Troubleshoot the Hadoop Error “Could not find or load main class jar”?

Problem scenario
You are trying to run a Hadoop operation.  You issue your “hadoop jar” or “hdfs jar” command.  You get one of these errors:

“Error: Could not find or load main class jar”

or this error

“Error: Could not find or load main class hadoop-streaming-2.8.1.jar”

What is wrong?

Solution
1.  Use “hadoop…” instead of “hdfs”.

2. 

What is MapReduce in Hadoop?

Problem scenario
You want to learn more about MapReduce.  You want to learn what it does so you can grasp Hadoop more thoroughly.  What is MapReduce?

Solution
MapReduce is a core process of Hadoop.  With multi-node deployments of Hadoop, MapReduce distributes data to different datanodes (servers that are controlled by a master server).  This data is retrievable thanks to MapReduce.  Conceptually there are two main components to MapReduce: mapper and reducer.