Snowflake SnowPark API — An Introduction

Soumak Das
5 min readSep 18, 2023

The Snowpark library offers a user-friendly toolkit for querying and processing data at scale within the Snowflake platform. With Snowpark, you can develop applications that handle data within Snowflake without the need to move data to an external system where your application code resides. This allows you to efficiently process data at scale, leveraging the power of the elastic and serverless Snowflake engine.

Snowflake currently provides Snowpark libraries for three programming languages: Java, Python, and Scala.

  1. Machine Learning with Snowpark Python
  2. Data Engineering Pipelines with Snowpark Python
  3. Getting Started With Snowpark for Python and Streamlit
  4. Building an Image Recognition App in Snowflake using Snowpark Python, PyTorch, Streamlit, and OpenAI
  5. Getting Started With Snowpark Scala

You can make the most of Snowpark libraries for the languages mentioned in the table below:

  • Java
  • Python
  • Scala

Key Features

Snowpark offers several distinctive features that set it apart from other client libraries:

Benefits Compared to the Spark Connector Developing with Snowpark, as opposed to using the Snowflake Connector for Spark, provides the following advantages:

  • Support for interacting with data within Snowflake using…

--

--

Soumak Das

Sr. Data Engineer @EY & Snowflake/Airflow/Databricks/AWS writer