Azure Synapse Artifacts client library for Java - version 1.0.0-beta.15
Azure Synapse is a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources—at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate BI and machine learning needs.
The Azure Synapse Analytics development client library enables programmatically managing artifacts, offering methods to create, update, list, and delete pipelines, datasets, data flows, notebooks, Spark job definitions, SQL scripts, linked services and triggers.
Source code | API reference documentation | Product documentation | Samples
Getting started
Adding the package to your project
Maven dependency for the Azure Synapse Artifacts client library. Add it to your project's POM file.
<dependency>
<groupId>com.azure</groupId>
<artifactId>azure-analytics-synapse-artifacts</artifactId>
<version>1.0.0-beta.15</version>
</dependency>
Prerequisites
- Java Development Kit (JDK) with version 8 or above
- An Azure subscription.
- An existing Azure Synapse workspace. If you need to create an Azure Synapse workspace, you can use the Azure Portal or Azure CLI.
az synapse workspace create \ --name <your-workspace-name> \ --resource-group <your-resource-group-name> \ --storage-account <your-storage-account-name> \ --file-system <your-storage-file-system-name> \ --sql-admin-login-user <your-sql-admin-user-name> \ --sql-admin-login-password <your-sql-admin-user-password> \ --location <your-workspace-location>
Authenticate the client
In order to interact with the Azure Synapse service, you'll need to create an instance of the ArtifactsClient class. You would need a workspace endpoint and client secret credentials (client id, client secret, tenant id) to instantiate a client object using the default DefaultAzureCredential
examples shown in this document.
The DefaultAzureCredential
way of authentication by providing client secret credentials is being used in this getting started section but you can find more ways to authenticate with azure-identity.
Create/Get credentials
To create/get client secret credentials you can use the Azure Portal, Azure CLI or Azure Cloud Shell
Here is an Azure Cloud Shell snippet below to
Create a service principal and configure its access to Azure resources:
az ad sp create-for-rbac -n <your-application-name> --skip-assignment
Output:
{ "appId": "generated-app-ID", "displayName": "dummy-app-name", "name": "http://dummy-app-name", "password": "random-password", "tenant": "tenant-ID" }
Create Artifacts client
Once you've populated the AZURE_CLIENT_ID, AZURE_CLIENT_SECRET, and AZURE_TENANT_ID environment variables and replaced your-workspace-endpoint with the URI returned above, you can create artifact clients. For example, the following code creates PipelineClient:
import com.azure.identity.DefaultAzureCredentialBuilder;
import com.azure.analytics.synapse.artifacts.PipelineClient;
import com.azure.analytics.synapse.artifacts.ArtifactsClientBuilder;
PipelineClient client = new ArtifactsClientBuilder()
.endpoint("https://{YOUR_WORKSPACE_NAME}.dev.azuresynapse.net")
.httpLogOptions(new HttpLogOptions().setLogLevel(HttpLogDetailLevel.BODY_AND_HEADERS))
.credential(new DefaultAzureCredentialBuilder().build())
.buildPipelineClient();
NOTE: For using an asynchronous client use PipelineAsyncClient instead of ArtifactsClient and call
buildPipelineAsyncClient()
Key concepts
Pipeline Client
The pipline client performs the interactions with the Azure Synapse service for getting, setting, updating, deleting, and listing piplines. Asynchronous (PipelineAsyncClient) and synchronous (PipelineClient) clients exist in the SDK allowing for the selection of a client based on an application's use case.
Examples
The Azure.Analytics.Synapse.Artifacts package supports synchronous and asynchronous APIs. The following section covers some of the most common Azure Synapse Analytics artifacts related tasks:
Sync API
The following sections provide several code snippets covering some of the most common Azure Synapse Artifacts service tasks, including:
Pipeline examples
Create a pipeline
createOrUpdatePipeline
creates a pipeline.
String pipelineName = "MyPipeline" + new Random().nextInt(1000);
PipelineResource createdPipeline = client.createOrUpdatePipeline(pipelineName, new PipelineResource()
.setActivities(new ArrayList<>()));
System.out.printf("Created pipeline with id: %s\n", createdPipeline.getId());
Retrieve a pipeline
GetPipeline
retrieves a pipeline.
PipelineResource retrievedPipeline = client.getPipeline(pipelineName);
List pipelines
GetPipelinesByWorkspace
enumerates the pipeines in the Synapse workspace.
PagedIterable<PipelineResource> pipelines = client.getPipelinesByWorkspace();
for (PipelineResource p : pipelines) {
System.out.printf("Retrieved pipeline with id: %s\n", p.getId());
}
Delete a pipeline
DeletePipeline
deletes a pipeline.
client.deletePipeline(pipelineName);
Async API
The following sections provide several code snippets covering some of the most common asynchronous Azure Synapse Artifacts service tasks, including:
- Create a pipeline asynchronously
- Retrieve a pipeline asynchronously
- List pipelines asynchronously
- Delete a pipeline asynchronously
Note : You should add
System.in.read()
orThread.sleep()
after the function calls in the main class/thread to allow async functions/operations to execute and finish before the main application/thread exits.
Create a pipeline asynchronously
createOrUpdatePipeline
creates a pipeline.
String pipelineName = "MyPipeline" + new Random().nextInt(1000);
client.createOrUpdatePipeline(pipelineName, new PipelineResource().setActivities(new ArrayList<>()))
.subscribe(p -> System.out.printf("Created pipeline with id: %s\n", p.getId()));
Retrieve a pipeline asynchronously
getPipeline
retrieves a pipeline.
client.getPipeline(pipelineName).subscribe(pipelineResponse ->
System.out.printf("Retrieved pipeline with id: %s\n", pipelineResponse.getId()));
List pipelines asynchronously
getPipelinesByWorkspace
enumerates the pipelines in the Synapse workspace.
client.getPipelinesByWorkspace().subscribe(pipeline ->
System.out.printf("Retrieved pipeline with id: %s\n", pipeline.getId()));
Delete a pipeline asynchronously
deletePipeline
deletes a pipeline.
client.deletePipeline(pipelineName).block();
Troubleshooting
General
Azure Synapse Artifacts clients raise exceptions. For example, if you try to retrieve an artifact after it is deleted a 404
error is returned, indicating the resource was not found. In the following snippet, the error is handled gracefully by catching the exception and displaying additional information about the error.
try {
PipelineResource deletedPipeline = client.getPipeline(pipelineName);
} catch (ResourceNotFoundException e) {
System.out.println(e.getMessage());
}
Default HTTP client
All client libraries by default use the Netty HTTP client. Adding the above dependency will automatically configure the client library to use the Netty HTTP client. Configuring or changing the HTTP client is detailed in the HTTP clients wiki.
Default SSL library
All client libraries, by default, use the Tomcat-native Boring SSL library to enable native-level performance for SSL operations. The Boring SSL library is an Uber JAR containing native libraries for Linux / macOS / Windows, and provides better performance compared to the default SSL implementation within the JDK. For more information, including how to reduce the dependency size, refer to the performance tuning section of the wiki.
Next steps
Several Synapse Java SDK samples are available to you in the SDK's GitHub repository. These samples provide example code for additional scenarios commonly encountered while working with Azure Synapse Analytics.
Additional documentation
For more extensive documentation on Azure Synapse Analytics, see the API reference documentation.
Contributing
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.
When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.