Nasuni Data Service & Azure AI Search Integration

Prev Next

Overview

This guide explains how to connect Azure AI Search to Nasuni Data Service (NDS) for Azure so you can index file content and metadata stored in Nasuni volumes. It focuses on NDS-specific configuration (endpoints, sample JSON, and index-schema guidance) while pointing you to Microsoft documentation for general Azure AI Search concepts (Data Sources, Indexers, Skillsets).

For general Azure AI Search concepts such as creating Data Sources, Indexers, and Skillsets, refer to Microsoft’s official documentation.

Prerequisites

Before beginning, ensure you have the following:

  • An NDS for Azure deployment connected to one or more Nasuni volumes, and the connection information.

  • An Azure AI Search service in your Azure subscription.

  • Familiarity with key Azure AI Search components (Data Source, Indexer, Indexes, and so forth).

Cost Awareness

Important: Indexing large datasets can incur significant Azure charges. Scope your first index to a small folder or test volume until you understand cost drivers.

Refer to Nasuni’s Azure AI Search Cost Awareness Tips for additional information and best practices.

Configure the Connection

1 - Collect Inputs

Parameter

Description

Example

<your-data-source-name>

A unique name for the data source

<your-nds-connection-string>

Your NDS connection string, including the SAS token.

BlobEndpoint=https://nds.uniquevalue.azurecontainerapps.io;SharedAccessSignature=sv=2025-05-05&ss=b&srt=s&st=2025-06-19T20%3A59%3A09Z&se=2030-12-31T05%3A00%3A00Z&sp=rl&sig=D4fyxagfi4Q6wnPrXEmBc5Umo%3D;

<your-nds-container>

The container name (Nasuni Volume GUID) to access.

5753b527-103c-4671-8f70-aee172b54-4

<your-container-query> (optional)

Sub-directory within the container/volume to index.
We recommend using this when testing your implementation, targeting a small subfolder to minimize indexing time and cost.

Accounting/North America/

2 - Create the Data Source

  1. In the Azure Portal, navigate to your Search Service and open the Data Sources section.
    Microsoft Azure interface displaying data sources and options for managing data connections.

  2. Click Add data source > Add data source (JSON).

  3. Paste the JSON below, replacing placeholders with your values.

{
"name": "<your-data-source-name>",
"type": "azureblob",
"credentials": {
"connectionString": "<your-nds-connection-string>"
},
"container": {
"name": "<your-nds-container>",
"query": "<your-container-query>"
}
}

Example:

{
"name": "nasuni-nds-connection",
"type": "azureblob",
"credentials": {
"connectionString": "BlobEndpoint=https://nds.uniquevalue.azurecontainerapps.io;SharedAccessSignature=sv=2025-05-05&ss=b&srt=s&st=2025-06-19T20%3A59%3A09Z&se=2030-12-31T05%3A00%3A00Z&sp=rl&sig=D4fyxagfi4Q6wnPrXEmBc5Umo%3D;"
},
"container": {
"name": "5753b527-103c-4671-8f70-aee172b54-4",
"query": "Accounting/North America/"
}
}
  1. Click Create to finish.

3 (Optional) - Define a Skillset for AI Enrichment

Azure AI Search Skillsets allow you to enrich your indexed data by applying AI capabilities. This includes key phrase extraction, entity recognition, image analysis, and language detection. The enrichment occurs before data is written to the index.

If you want to extend your indexing pipeline with these capabilities, you can create a Skillset after defining your Data Source.

Skillsets rely on other Azure services (such as Azure Cognitive Services) and may require additional configuration, permissions, and Azure costs. This section provides a starting point with a sample JSON template to help you explore this functionality.

Note: While we’re providing this example as a helpful guide, we recommend consulting Microsoft’s documentation and your Azure team for detailed configuration of Skillsets to match your specific use case.

Here’s an example Skillset definition that uses built-in AI skills for text enrichment:

{
  "name": "nds-skillset",
  "description": "Extract key phrases and detect language for NDS data",
  "skills": [
    {
      "@odata.type": "#Microsoft.Skills.Text.KeyPhraseExtractionSkill",
      "name": "key-phrases",
      "description": "Extract key phrases from content",
      "context": "/document",
      "inputs": [
        {
          "name": "text",
          "source": "/document/content"
        }
      ],
      "outputs": [
        {
          "name": "keyPhrases",
          "targetName": "keyPhrases"
        }
      ]
    },
    {
      "@odata.type": "#Microsoft.Skills.Text.LanguageDetectionSkill",
      "name": "language-detection",
      "description": "Detect language of the content",
      "context": "/document",
      "inputs": [
        {
          "name": "text",
          "source": "/document/content"
        }
      ],
      "outputs": [
        {
          "name": "languageName",
          "targetName": "language"
        }
      ]
    }
  ]
}

To use a Skillset:

  1. Define it in Azure AI Search (via the portal or JSON API).

  2. Reference the Skillset name in your Indexer configuration.

  3. Review Microsoft’s documentation for additional skill types and guidance on integrating with Cognitive Services:

    1. Azure AI Search Skillsets Overview

    2. Built-in Skills Reference

4 - Create the Index

  1. Navigate to Indexes, and click Add index > Add index.

  2. Define your index schema by specifying fields. We recommend starting with the following fields:

Field name

Type

Retrievable

Filterable
Sortable

Facetable

Searchable

Description

id 🔑

String

Yes

No

No

Unique identifier for each document.

content

String

Yes

No

Yes

The text extracted from file content.

metadata_storage_name

String

Yes

No

Yes

Name of the file with its file type extension.

metadata_storage_path

String

Yes

No

Yes

The complete path to the indexed file in URL format, beginning with your NDS endpoint.

Creating an index in Microsoft Azure with various field settings and options.

Additional fields can be added based on your indexing needs. Refer to the Azure documentation for additional details about content metadata properties. If you connect to your NDS endpoint using a tool like Azure Storage Explorer, you can view the metadata of a file to see the metadata presented by NDS that is available for indexing.

  1. Click Create.

5 - Create the Indexer

  1. Navigate to Indexers, and click Add indexer > Add indexer.

  2. Name the indexer, and select the Datasource and Index you previously created. Click Save.

  3. The indexer automatically begins an initial run.

6 - Validate the Index

  1. Allow the initial Indexer run to complete.
    Execution history of an Azure Indexer showing successful run details and statistics.

  2. Navigate to Indexes and select your index. Search for a known phrase or file name to confirm documents were ingested successfully (for example, a phrase from a document that has been indexed).
    Azure Search Explorer displaying document details and metadata storage information.