Queries Flashcards by Giorgenes G

How do you do an analysed query (full text)?

GET _search
{
   query: {
     match: {
       "text_entry": "< text to search >"

How well did you know this?

Not at all

Perfectly

What’s the maximum number of HITS a query can have?

10_000 by default.

If you want more you need to use the scroll API.
TODO: What is the scroll API?

How well did you know this?

Not at all

Perfectly

What is the relevancy score?

A score of how relevant a document is given an analysed search.
This only affects analysed searches (match query), because it wouldn’t make sense for term queries.

How well did you know this?

Not at all

Perfectly

How is the relevancy score calculated?

TODO: Learn more about this, is this required for the certification?

How well did you know this?

Not at all

Perfectly

What’s the difference between match and match_phrase?

Match phrase will match the whole search term, while “match” will match any words in the search term.

How well did you know this?

Not at all

Perfectly

What analyser is used by default when doing a query?

The analyser specified for the field in the index mappings.

How well did you know this?

Not at all

Perfectly

How to do a simple multi match query?

GET _search
{
    query: {
         multi_match: {
            query: "< text to search >",
            fields: ["< field to search >", .... ]

How well did you know this?

Not at all

Perfectly

What is the “query string” query type?

Performs a query with “query string” style params. Allows booleans operation in the query like “this OR that”.

GET _search
{
query: {
“query_string”: {
default_field: “< field to search >”,
query: “< boolean query syntax >”

TODO: Learn more about this query DSL.

How well did you know this?

Not at all

Perfectly

How do you do a term level query?

GET _search
{
    query: {
       term: {
          < field name >: {
                value: " < keyword to search > "

How well did you know this?

Not at all

Perfectly

What’s the difference between term and match searches?

Term searches match the whole field is is used with “keyword” type fields, while match queries are analysed: the text is tokenized and intersected with the analysed field in the document, thus allowing partial matches.

How well did you know this?

Not at all

Perfectly

How do you search for multiple terms on the same field?

GET _search
{
   query: {
       terms: {
           < field name >: [
                " < value 1 >",
                ....,
                " < value n > ",
           ]

How well did you know this?

Not at all

Perfectly

How do you do a numerical range search?

GET _search
{
      query: {
          range: {
             "< field name >": {
                  gte: < value >,
                  lte: < value >,

TODO: Learn more about this query.

How well did you know this?

Not at all

Perfectly

What fields can the range query operate on?

numerical and date fields.

How well did you know this?

Not at all

Perfectly

How do you do a wildcard term query?

GET _search
{
    query: {
         wildcard: {
             "< field name >": {
                 value: "< wildcard term >"

TODO: What’s the performance impact on this?

How well did you know this?

Not at all

Perfectly

How do you do a regex term query?

GET _search
{
      query: {
          regexp: {
             "< field name >": "< regexp >"

How well did you know this?

Not at all

Perfectly

How do you create compound queries?

Study These Flashcards

GET _search
{
query: {
bool: {
must: [< term or match query, etc >],
must_not: [< term or match query etc >],
should: [< term or match query etc >],

How does a “must” compound query work?

Study These Flashcards

Returns documents that match ALL queries in the must block. (AND query).

How does a “must_not” compound query work?

Study These Flashcards

Excludes any document that matches the “must_not” block.

How does a “should” compound query work?

Study These Flashcards

Provides optional queries to match that will increase the relevancy score, but they are not required to be present in the document. Only affects the score.

How do you make an “OR” query using the “should” block in a compound query?

Study These Flashcards

Set “minimum_should_match” option. This will required that at least N of those queries match. If you set it to 1 it will effectively work an “OR” type of query.

How do you use a filter and what is it for?

Study These Flashcards

Filters work the same as queries but they don’t affect the relevancy score and are therefore more efficient.

Same syntax as query.

How do you name a query and what do you use it for?

Study These Flashcards

Add it to your query like so:

“term”: {
“field”: {
“_name”: “< query name >”,
“value”: ….

This will add a “matched_queries” field to your search results so you know which queries matches each document.

How do you highlight the matching words?

Study These Flashcards

GET _search
{
    query: ....,
    highlight: {
         pre_tags: ["< tag1 >],
         post_tags: [" < / tag 2 >"]
         fields: {
             "< field to highlight >": {}
         }
    }

This will add a “highlight” field to the results.

How do you sort the results of a query?

Study These Flashcards

GET _search
{
    "sort": [
        {
            "< field >": {
              order: "< asc | desc >"

TODO: how does it impact score?

How do you do pagination in your queries?

It's paginated by default GET _search?size=< page size >&from=< offset > ``` GET _search { size: < page size>, from: < offset >, query: .... } ```

What's the default page size for elastic?

What is the scroll API, how does it work and what do you use it for?

- Search is limited to 10k documents by default. - With scroll you set a time window for the search to keep going for a specific amount of time. ``` # To initiate the scroll GET _search?scroll=10m&size=< scroll size > ``` This will return a "scroll_id" To continue fetching from the scroll GET _search/scroll { scroll: "10m", scroll_id: "< scroll id >" Deleting the scroll DELETE _search/scroll { scroll_id }

What are some best practices for the scroll API?

- Sort your results by id (_doc field) | - Delete the scroll after use

How do you close all scrolls at once?

DELETE _search/scroll/_all

How do you slice your scroll (run in parallel)?

``` GET _search/scroll=10m { slice: { id: 0, max: < max number of slices > } ``` ``` GET _search/scroll=10m { slice: { id: N -1 , # up to the number of slices max: < max number of slices > } ``` Create a scroll for each "slice", so you can fetch in parallel each scroll.

How to do fuzzy searches in elastic?

``` # match type GET _search { query: { match: { "< field name >": { query: "< text to search >", fuzziness: < fuzziness > } ``` ``` # term type GET _search { query: { fuzzy: { "< field name >": { value: "< value to search > " fuzzyness: 1, transpositions: true|false ``` TODO: Read more in the documentation.

What does the fuzziness param mean in a fuzzy query?

It's the number of modifications in the original token. TODO: Read more about this.

What is transposition in a fuzzy query? Give an example.

Whether to allow characters to transpose (flip) For example: ``` transposition = true: "teh" matches "the" transposition = false: "teh" doesn't match "the" ```

How do you create a template query?

``` # Test the query GET _search/template { source: { query: { ..... value: "{{ param name }}" } } params: { "< param name >": "< value >" } } ``` ``` # Save the query POST _scripts/< query name to save > { script: { lang: "mustache", source: { .... } } } ``` ``` # Using the query GET _search/template { id: "< template name >", params: { .... } } ``` TODO: What other types of scripts/languages can I create?

How do you set default values in a template query?

.... "value": "{{ param }}{{^param}}< default value >{{/param}}" ....

How do you perform a remote cluster search?

GET < cluster name >:< index name>/_search

Queries Flashcards

(36 cards)