Introduction To Azure Cognitive Search Flashcards
Azure cognitive search:
Searching for information online has never been easier full-stop however it’s still a challenge to find information from documents that aren’t in a search index.
For example Everyday People deal with unstructured typed image-based or handwritten documents. Often people must manually read through these documents to extract and record their insights in order to persist the found data. Now we have Solutions that can automate information extraction. More information at the bottom
Knowledge mining:
Knowledge mining is the term used to describe Solutions that involves extracting information from large volumes of often unstructured data.
One of these knowledge mining Solutions is as a cognitive search a cloud search service that has tools for building using managed indexes.
The end exes can be used for internal use only or to enable searchable content on public-facing internet assets.
Importantly as a cognative search can utilise the built-in AI capabilities of azure cognitive services such as image processing content extraction and natural language processing to perform knowledge meaning of documents.
The product ay-ay-ay capabilities make it possible to index previously untouchable documents and to extract and surface insights from large amounts of data quickly.
Azure cognitive search:
As a cognitive search provides the infrastructure and tools to create search Solutions that extract data from various structured semi-structured and unstructured documents.
As your cognitive search results can only contain your data which can include text in word or extracted from images or new entities and key phrases detection through text analytics
It’s a platform-as-a-service paas solution.
Microsoft managers the infrastructure and availability allowing your organisation to benefit without the need to purchase or managed dedicated hardware resources.
As your cognitive search features:
As you’re a cognitive search exist to complement existing technologies and provides a programmable search engine built on Apache Lucian and open software library.
It’s a highly-available platform offering 99.9% uptime SLA available for cloud and on-premises assets.
Azure cognitive search comes with the following features:
Data from any Source Code long as you’re cognitive search access starter from any source provider jsonformat with auto crawling support for selected data sources in Azure.
Full text search and analysis: azure cognitive search offers full text search capabilities supporting both query and full lucene query syntax.
Ai-powered search Caroline azria cognitive such as cognitive AI capabilities built-in for image and text analysis for all content from your content.
Multilingual call on azure cognitive services office domestic and Alice’s 456 languages to intelligently handle phonetic matching or language-specific linguistics. Natural language processors available in Azure cognitive search are also used by being an office. Continued below
Do you have enabled Caroline Adria cognitive search supports jio search filtering based on proximity to a physical location.
Configurable user experience scaling azure cognitive search has several features to improve the user experience including auto complete auto suggest pagni pagination and hit highlighting
Identify elements of a cert solution Caroline you line a typical azure cognitive services ocean starts with a data source that contains the data artifacts you want to search.
This could be hierarchy hierarchy of folders and files in Azure storage or text in a database such as as a SQL database to azure cosmos db.
The data format that cognitive search supports is jason.
Regardless of where your daughter originates if you can provide it as a dress and document the search in the search engine can index it.
If your data resides in supported data source you can use an indexer to automate data ingestion including jsonserialization of source data in data formats will stop and index it connects to a data source who realises the data and passes to the first search engine for indexing. Does indexes support change detection which makes data refresh a simple exercise
Besides automating data ingestion indexes also support AI in richmond.
You can attach a skill set that applies a sequence of AI skills to enrich the data and making it more suitable for stop a comprehensive set of balls and skills based on cognitive services API can help you derive newfields-for example by recognising entities Intex translating text evaluating sent to mentor predicting appropriate captions for images for stop optionally enriched content can also be sent to a knowledge store which stores output from an AI enrichment pipeline in tables and blobs in as your storage for independent analysis or downstream processing
Whether you write application code that pushes data to an index or use an index that automates data ingestion and adds enrichment-the Fields containing your content are perished in an index which can be searched by client applications.
The Fields are used for searching filtering and sorting to generate a set of results that can be displayed or otherwise used by the client application.
Use a skill set to define and enrichment pipeline:
And AI enrichment refers to embedded image and natural language processing in a pipeline that extracts text and information from plant that can’t otherwise be indexed for full text search.
Ai processing is achieved by adding and combining skills and her skill-set. As skillset defines the operations that extract and enrich data to make it searchable. These are skills can either be built in skills such as text translation or optical character recognition OCR or custom skills that you provide.
Building skills:
Building skills are based on free trade models from Microsoft which means you can’t train the model using your own training data.
Skills that call the cognitive resources API have a dependency on the services and are billed at the cognitive service pay as you go price when you attach a resourceful stop other skills are made by azure cognitive search all our utility skills that are available at no charge
Building skills fall into these categories:
Natural language processing skills with the skills and structured text is mapped as sociable and filter of all fields in an index.
Some examples include:
Keyphrase extraction: uses a pre-trained model to detect important phrases based on term placement linguistic rules proximity to other terms and how unusual the term is within the source tata.
Text translation skill: uses a Preacher and model translate the input text into various languages for normalisation or localisation use cases.
Image processing skills Quran creates tax representations of an image content making it’s searchable using the query capabilities of azure cognitive search.
Some examples include:
Image analysis skill colon uses an image addiction algorithm to identify the content of an image and generate a text description.
Optical character recognition skill Caroline allows you to extract printed or handwritten text from images such as photos of street signs and products as well as from documents-invoices bills financial reports articles and more.
Understanding indexes: And other cognitive search index can be thought of as a container of searchable documents. Conceptually you can think of an index as a table and each row in the table represents a document. Tables of columns and the columns can be thought of as equivalent to the fields and a document.
Columns have data types just as Fields do on the documents.
Index schema: In Asia cognitive search and index is a persistent collection of Jason documents and other content used to enable search functionality for stop the documents within an index can be thought of as rows in a table each document is a single unit of searchable data in the index.
The index includes a definition of the structure of the data in these documents called its schema.
And example of an index schema with a i extracted Fields key phrases and image tags
Displayed as a document called json new line it has a name with an index and fields
They are curly brackets with that contain the name type Analyser and fields the name type and Eliza and fields each have their own value name is content type is EDM dot string Analyser is standard dot loosen and then filter square brackets this is repeated three times with different values depending on the field The name has the value of content phrases and image tags in each of the three whatever you call them content being first phrase as being sick and image dad’s been that type first sydm string type S is collection idiom string type series collection idiom string
Analyser type 1 standard loosen Analyser type 2 standard loosen Analyser type 3 the same field as just feels with empty []
Index attributes:
As the cognitive search needs to know how you would like to search and display the fields in the documents.
You specify that by assigning attributes or behaviours to these fields.
For each field in the document the index stores its name the data type and they supported behaviours for the fields such as is the field searchable or can the field be sorted
The most efficient index is used only the behaviours that are needed for stop
If you forget to set a required behaviour on a field when designing the only way to get that feature is to rebuild the index.
Use an indexer to build an index:
In order to index the documents in Azure storage 32 be exported from the original file type to Jason.
In order to export data in any format to Jason and loaded into an index we use it indexer.
New line to create such documents you can either generate Jason documents with application code or you can use Asia indexer to export incoming documents into Jason.
Has your cognitive search lets you create and load Jason documents into an index with two approaches kola
Push method Caroline Jason data is pushed into a search index via either the rest API or the dotnet sdk.
Pushing the actor has the most flexibility as it has no restrictions on the data source type location or frequency of execution.
Full method turn on such services indexes can pull data from Popular as a data sources and if necessary export that data into Jason if it isn’t ready in that format.
Useful method to load data with an indexer:
80 cognitive searches indexer is a crawler that extracts searchable text and meta data from an external as a data source and populates a search index using field to field mapping between source data and your index.
Using the index is sometimes referred to as a formal approach because the service pools.in without you having to write any code that adds data to an index.
And index the maps source to their matching fields in the index.