Azure Search
Azure Search is a search-as-a-service solution that allows developers to incorporate a sophisticated search experience into web and mobile applications without having to worry about the complexities of full-text search and without having to deploy, maintain or manage any infrastructure
-
Clone entire Azure Search instance, just like I can clone App Service.
I would be extremely handy if I could clone entire Azure Search instance to the same or a new Resource Group, in the same or a new location using the same or another Azure Subscription. Like I can clone Azure App Service wherever I want.
User case: If there is an issue which is reproducible with production instance of Azure Search, then I will clone in to a separate sandbox, and take it from there. As of today, I have to create a brand new Azure Search instance, clone SQL Database and reindex them to get the same data.
3 votes -
Allow custom defined Field Mapping Function
Currently when defining the indexers, one can specify the field mappings from source to target fields. Only a limited set of functions such as Extract token at position etc are supported. It would be desirable to support Func syntax such that each row is passed onto the func which then decide the target field value based on developer defined logic in func.
Eg: we have a cosmo db source and a azure table storage data source. The partition key for cosmo db collection is combination of tenantId:studentId for efficient read/writes across partitions based on our data. The azure table storage…
3 votes -
Table Storage Indexing of multiple tables at scale
I believe that a good pattern for Azure Table Storage is to enable fast deletes by simply dropping tables. I've implemented a multi-tenant model for my data using a table per tenant, and would like to aggregate search data across tables.
The current recommendation is to create an indexer per table. This means the number of tenants I support (which I'm hoping to be in the thousands) is limited to the number of indexers I can create, which is a very low number.
The latest table storage SDK has methods to asynchronously list tables in a segmented fashion using continuation…
14 votes -
Provide warmup procedure
When we scale out Search replica, we can find delay peak right after it scaled. It gives throttled query, so I want to reduce it when we apply scheduled replica change.
It would be good to provide "predefined warm-up query sets" After-Scaleout event. What about to give textbox, and GET-queries are performed line by line multiple times.
Note that I'm using Azure Automation with this script to change replica count by schedule; https://gallery.technet.microsoft.com/scriptcenter/azure-search-change-c0b49c4c
2 votes -
Support for indexing multi-line JSON files
Support for indexing multi-line JSON files (https://en.wikipedia.org/wiki/JSON_Streaming). Currently blob indexer extracts entire json file as one document.
It would be also nice to pair this with the gzip support so that it will be able to index *.json.gz files
4 votes -
27 votes
-
SAT support for query tokens
As with Blog/containers it's possible to have a SAT token and time limit it. This would be a great benefit for azure search too, to allow query keys with the same concept.
6 votes -
Cover Azure Search in the HIPAA BAA
Please add Azure Search to the HIPAA BAA so we can use it in healthcare software.
9 votes -
Indexer execution status to App insights
Scenario: Thanks to the power of Azure, I have setup a indexer which continuously updates my index from the Azure sql db. Now I want to know if there are any errors while processing any document and why it failed to update the index(error message). This information is very critical for new documents or any document which is modified in the prod database. Note: Different teams do not have access to the Azure search service portal.
Current state: We already have indexer execution status but this information needs to be processed by another process by polling.
Ask: If this information…
18 votes -
Proximity Search with exact order support
The lucene query syntax for proximity search allows us to find 2 terms X number of words apart.
A search for "hotel airport"~5 could return documents with "Find a hotel airport" or "hotel near airport" or "going to airport? check out this hotel".
It would be a big benefit to our clients if we could have a mode for proximity search where it doesn't matter what the order of the words are, as well as a different, more restrictive mode, where proximity search will only find results with the words in in explicit order.
1 vote -
Configurable matching strategy for more natural searches.
You can see full details here:
http://stackoverflow.com/questions/43505244/how-to-make-searches-more-natural/43509264#43509264The idea is to have something like:
EnableFuzzy, Enable PartialMatching on the SearchParameters to avoid constructing weird search strings.3 votes -
Provide ability of collection Type field to be searched against individual values separately
For a document, where there are multiple values in its collection type column, it gives it lesser score (and hence either to come lower or may not even come in the result) to it even though the search query has matched with ONE of the values in the list COMPLETELY.
This is because, currently Azure Search considers list of all the values provided in the collection type as a single string by concatenating all of them.
My requirement is that if the search query (for that field) matches with any of the values in the collection, it should consider it…26 votes -
Provide option to facet on all facetable fields in a Search request
Some applications can have a large number of facets and it is inconvenient to have to specify them all explicitly in the Search API, as reported here: http://stackoverflow.com/questions/28542665/azure-search-what-if-i-have-a-lot-of-facets
If there were an option like facet=* that returned facets on all facetable fields, that would be easier for such cases.
11 votes -
Use a XSD schema to populate the index fields
We have large scehams that define individual aspects of healthcare data. We would like to use these schemas to define the fields than entering most of these manually. For this to work, obviously we need support for crawling XML data (similar to what you have for JSON). XML provides a good structure and has many industry standard schemas that we can leverage.
3 votes -
DocumentDB indexer should be able to index MongoDb collections
If I create a MongoDB collection backed by DocumentDB, I should be able to use the DocumentDB indexer to ingest it into Azure Search.
4 votes -
Add machine learned ranking models
Similar to Solr's (small) capabilities
https://cwiki.apache.org/confluence/display/solr/Learning+To+Rank15 votes -
distinct values in search result
Consider the situation that, two document have some same property and I want to select those same fields and count the document as one(distinct) and select that distinct document.
34 votes -
allow indexers to call DocumentDB stored procedures
We have difficulty in transferring documents where we require multiple maps of our data into the flat search index. There is a limit of two user defined functions (which we use to do this) and there doesn't appear to be a way to call a stored procedure through the query language. Having the search indexers support DocumentDB stored procedures gives us far more power and flexibility to map the data correctly. At the moment we have had to revert back to writing our own code to do this.
15 votes -
Allow create, update, delete of Data Sources, Indexes, and Indexers with Service through ARM templates
You currently can only create a service instance through an ARM template, you then need to make REST calls to create, update, or delete Data Sources, Indexes, and Indexers. This makes using Azure Search a challenge in solutions that are released in a Continuous Delivery model. If I want to use Release Management in VSTS to automate deployments, I need to write additional PowerShell scripts to call Azure Cmdlets to get the Primary Key for the service, then call REST APIs through PowerShell, and suck in JSON files to send as part of the request. It's a very complicated process…
26 votes -
Having WebJob in process to execute before Indexing while we want to reply events from eventstore and create the DB
Having WebJob in process to execute before Indexing while we want to reply events from eventstore and create the DB
1 vote
- Don't see your idea?
