I routinely run Impala queries on a remote machine and want the results to save to a CSV file on that machine.
I recently realized that I should setup an Impala config file on that machine to configure the connection to the Impala cluster and the output file format.
So I created a text file at
~/.impalarc with the following settings:
[impala] impalad=CLUSTER-ADDRESS:PORT output_delimiter=, verbose=true ignore_query_failure=false default_db=prd write_delimited=true print_header=true
Now when I run a query with
impala-shell, I don’t have to specify the address of the cluster, or the various flags required to get a CSV with a header.
Other impala-shell config options are specified here