Resolution
-----------



1.    [Download and install](http://www.squirrelsql.org/#installation) SQuirrel SQL Client.


2.    [Connect to the master node using SSH](https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-connect-master-node-ssh.html).


3.    On the master node, run the following command to start Spark Thrift Server:





```plaintext
sudo /usr/lib/spark/sbin/start-thriftserver.sh
```



4.    Copy all **.jar** files from the **/usr/lib/spark/jars** directory on the master node to your local machine.


5.    Open SQuirrel SQL Client and create a new driver:  
For **Name**, enter **Spark JDBC Driver**.  
For **Example URL**, enter **jdbc:hive2://localhost:10001**.


6.    On the **Extra Class Path** tab, choose **Add**.


7.    In the dialog box, navigate to the directory where you copied the **.jar** files in step 4, and then select all the files.


8.    In the **Class Name** field, enter **org.apache.hive.jdbc.HiveDriver**, and then choose **OK**.


9.    On your local machine, set up an SSH tunnel using [local port forwarding](https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-ssh-tunnel-local.html):





```plaintext
ssh -o ServerAliveInterval=10 -i path-to-key-file -N -L 10001:localhost:10001 hadoop@master-public-dns-name
```



10.    To connect to the Spark Thrift Server, create a new alias in SQuirrel SQL Client:  
For **Name**, enter **Spark JDBC**.  
For **Driver**, enter **Spark JDBC Driver**.  
For **URL**, enter **jdbc:hive2://localhost:10001**.  
For **Username**, enter **hadoop**.


11.    Run queries from SQuirrel SQL Client.





---








I want to I configure a Java Database Connectivity (JDBC) driver for Spark Thrift Server so that I can run SQL queries from a SQL client on my Amazon EMR cluster.

Set up a Spark SQL JDBC connection on Amazon EMR

How do I set up a Spark SQL JDBC connection on Amazon EMR?

Analytics

How to add a collaborator to edit Custom SQL datasets on Amazon QuickSight

How do I set up an SSL connection between Hive on Amazon EMR and a metastore on Amazon RDS for MySQL?

How do I connect to a Redshift cluster using Spark in my EMR cluster?

How do I set Spark parameters in Amazon EMR?

How do I troubleshoot a failed or stuck Spark SQL query in Amazon EMR?

How do I connect Amazon RDS - Microsoft SQL Server through Glue Spark type jobs using python ?

Reading Aurora Postgress Table with Spark SQL on EMR

How to access a DynamoDB table from Spark SQL?

Connect Microsoft SQL Server on EC2 to EMR Serverless

EMR serverless spark jobs connection with postgresql

How do I set up a Spark SQL JDBC connection on Amazon EMR?

Resolution

Relevant content