Migrating SQL Data into Azure Cosmos DB

By:   |   Updated: 2022-02-08   |   Comments   |   Related: > Azure Cosmos DB


Please do not scroll away - stay informed.
Dear Database Professional,

Did you know that MSSQLTips.com publishes new SQL Server content on a daily basis as well as offers free webinars and tutorials?

We know your day is hectic and you don't necessarily have time to research new topics and solutions every day, but we can keep you informed.

Take 30 seconds to register for our newsletter and look for free educational content to help you grow your career. >> REGISTER HERE <<

Thank you,
Greg Robidoux and Jeremy Kadlec (MSSQLTips.com Co-Founders)
Problem

The tutorial, Introduction to the Azure Cosmos DB, helped you get familiar with Microsoft Azure Cosmos DB as a scalable NoSQL database. We used Azure Cosmos DB for Free to create a new container, database, insert JSON documents, and query data stored in the container. In this tutorial we look at how to migrate SQL relational data into Azure Cosmos DB.

Solution

The Azure Cosmos DB Data Migration tool can import data from various sources into Cosmos DB. The supported formats are outlined below:

  • JSON
  • CSV
  • SQL
  • MongoDB
  • Azure Table Storage
  • DynamoDB
  • Azure Cosmos DB SQL API Collections

Prerequisites

The Azure Cosmos DB Data Migration tool requires Microsoft .Net framework 4.51 or higher to run the tool. Download the executable package from GitHub.

Azure Cosmos DB Data Migration tool

Extract the compressed data migration tool files and the folder container to access the following:

  • dt.exe: Command-line version of data migration tool
  • dtui.exe: Graphical user interface version of data migration tool
Azure Cosmos DB Data Migration tool

Migrate SQL Data into Azure Cosmos DB

First, you need SQL relational data that you want to migrate to Azure Cosmos DB. This tip uses the [AdventureWorks2019] database and runs the following T-SQL statement that retrieves data for migration.

SELECT CAST(businessentityid AS VARCHAR) AS Id, 
       NAME, 
       addresstype                AS [Address.AddressType], 
       addressline1               AS [Address.AddressLine1], 
       city                       AS [Address.Location.City], 
       stateprovincename          AS [Address.Location.StateProvinceName, 
       postalcode                 AS [Address.PostalCode], 
       countryregionname          AS [Address.CountryRegionName] 
FROM   sales.vstorewithaddresses 
WHERE  addresstype = 'Main Office'  

Note: If you are doing this with production data it would make sense to tune your SQL query to retrieve relevant columns and records for optimized migration.

The query returns 701 rows for data migration as shown below.

query results

We use the column alias such as [Address.AddressType] or [Address.Location.City]. The "." is a nesting operator to create the subdocuments. We will explore the generated JSON data later in this tutorial.

Double-click on dtui.exe to launch the Azure Cosmos DB Migration Tool. The Welcome page gives tool introductory information, an external link for documentation, and new feature voting.

Azure Cosmos DB Data Migration tool welcome

Click Next.

Source Information:

The source information page asks what type of data you want to import. It specifies JSON files, MongoDB, MongoDB export JSON file, SQL, CSV files, Azure Table, DynamoDB, HBase, and Azure Cosmos DB.

Azure Cosmos DB Data Migration tool source information

Choose SQL.

Then enter a standard SQL connection string like below with the appropriate values.

Data Source=myServerAddress;Inital Catalog=myDataBase;UserID=myUsername;Password=myPassword;

Specify your connection string with correct values and click Verify to see if the tool can connect to your SQL database. In the Enter Query section paste the query to fetch the data required for migration. You can also use a SQL file with the option "Select Query File".

The Nesting Separator is for modifying the document structure based on the specified operator. Specify "." as the nesting operator as shown below which we used above.

Azure Cosmos DB Data Migration tool source information

Click Next.

Target Information:

The target information page requires the following user inputs.

Azure Cosmos DB Data Migration tool target information

Export To:: You need to specify the export mode to the Azure Cosmos DB container using the sequential or bulk import method. The sequential method copies the records one by one. Therefore, it can migrate data to single or multiple partitions container. The bulk importer method uses a stored procedure to migrate data into single partition collections during the import operation. We will use Azure Cosmos DB – Sequential record import (partitioned collection) in this tip.

Connection String: The data migration tool requires Cosmos DB connection string in the following format.

AccountEndpoint=<CosmosDB Endpoint>;AccountKey=<CosmosDB Key>;Database=<CosmosDB Database>;

For the connection string (AccountEndpoint=<CosmosDB Endpoint>; AccountKey=<CosmosDB Key>), navigate to the Azure Cosmos DB account and copy the primary connection string from the key tab as shown below.

azure cosmos db keys

Append the database name on the right side of the primary connection string. Click Verify to validate the connection status for the Azure Cosmos DB account. As shown below, it successfully connected to the Azure Cosmos DB account.

Azure Cosmos DB Data Migration tool target information

Collection: Specify the container name in the collection field.

Azure Cosmos DB Data Migration tool target information

The target information field has additional configurations such as Partition Key, Collection Throughput, Id Field, and Advanced Options.

Advanced Options

The advanced options has the following configurations.

Azure Cosmos DB Data Migration tool target information
  • Number of parallel requests: The migration tool uses two parallel requests by default. You can increase the parallel requests in case the documents are small.
  • Disable automatic id generation: We can select this option to disable automatic ID generation. Usually, you should select it if the document has an ID field that uniquely defines it.
  • Number of retries on failure: If the data migration fails due to transient failures (network interruption), the number of retries can help to restart the process automatically. Its default value is 30.
  • Retry Interval specifies the wait type between retrying connection to Cosmos DB during network interruptions.
  • Connection mode: The connection mode supports DirectTcp, DirectHttps, and Gateway modes. By default, it uses DirectTcp mode for connection.
  • Enter Indexing policy: The data migration tool allows to specify indexing policy for the collections. It provides the following policy templates.
    • Default: is suitable for performing equality queries against strings or using ORDER BY, range or equalities for numbers.
    • Range: The range policy is suitable for using ORDER BY, range queries on numbers and strings.
Azure Cosmos DB Data Migration tool target information

We are not specifying an indexing policy, therefore the tool uses the default indexing policy.

Click Next.

Advanced:

You can specify an error log file, control the detail in the log file, and progress update interval as shown below.

Azure Cosmos DB Data Migration tool advanced

Summary:

We can review the migration summary and generate the resulting migration command.

Azure Cosmos DB Data Migration tool summary

Click on the View Command button to generate the commands if we do not want to run it in graphical mode.

Azure Cosmos DB Data Migration tool summary

Once we have verified the source and target configurations, click Import.

Results:

It starts the SQL data migration to Azure Cosmos DB and reports the elapsed time, transferred count, and failure information if any. The following screenshot states that it transferred 701 documents into Azure Cosmos DB without failure.

Azure Cosmos DB Data Migration tool results

Query Imported Data

Let’s use data explorer to query the imported data into Azure Cosmos DB.

Start a new query tab and it displays the default query that retrieves all records. As shown below, the SQL data is imported into JSON format into Cosmos DB.

data explorer

We can sort documents using an ORDER BY clause similar to the T-SQL statement.

data explorer

Wrapping Up

That's it. That was a simple use case of how to import SQL data into Azure Cosmos DB.

Next Steps
  • Implement an Azure Cosmos DB container and database for migrating a SQL Database into it.
  • Refer to the Cosmos DB documentation for further details.
  • Go through existing tips on Microsoft Azure.
  • Stay tuned for more articles on Azure Cosmos DB.





get scripts

next tip button



About the author
MSSQLTips author Rajendra Gupta Rajendra Gupta is a Consultant DBA with 14+ years of extensive experience in database administration including large critical OLAP, OLTP, Reporting and SharePoint databases.

View all my tips


Article Last Updated: 2022-02-08

Comments For This Article





download














get free sql tips
agree to terms