Intro to SQL

By |

Ever want to know how to communicate with a database? You need to know SQL, a standard programming language for working with relational database management systems in data warehouses or just Microsoft Access. This workshop will cover the basic syntax of SQL. Material will focus mainly on how to query databases. A web-based tool will be used for the tutorial.

Intro to SQL

By |

Ever want to know how to communicate with a database? You need to know SQL, a standard programming language for working with relational database management systems in data warehouses or just Microsoft Access. This workshop will cover the basic syntax of SQL. Material will focus mainly on how to query databases. A web-based tool will be used for the tutorial.

Flux HPC Blog: Querying data with SparkSQL

By | Data, General Interest, HPC, News

SparkSQL is a way for people to use SQL-like language to query their data with ease while taking advantage of the speed of Spark, a fast, general engine for data processing that runs over Hadoop. I wanted to test this out on a dataset I found from Walmart with their stores’ weekly sales numbers. I put the csv into our cluster’s HDFS (in /var/walmart) making it accessible to all Flux Hadoop users.