Natural language processing and how Power BI uses it

What is Natural Language?

Natural language refers to the way we, humans, communicate with each other, either in speech form or text.

In today’s world, only 21% of data is in structure form. Most of the data is in text form, like whatever we tweet, message comment, etc., which is unstructured.

Introduction

Computers are great at working with structured data like database tables and financial records. They are able to process that data much faster than humans. But we don’t communicate in “structured data” form. We, humans, communicate using words, which is an unstructured form.

However, there is no standardized technique to work with unstructured data. We program computers using C++, Python or R, we just give computers some set of rules that they should operate by.

NLP and its components:

NLP is a sub-field of AI that is focused on enabling computers to understand and process human language. It is basically the ability of a computer to understand, analyze, manipulate, and potentially generate human language. In general terms, NLP tasks break down language into shorter, elemental pieces, try to understand relationships between the pieces, and explore how the pieces work together to create meaning.

Components of NLP:

Natural Language Understanding

It uses computer software to understand input language in the form of text/speech
It basically understands what a speaker actually means.

Natural Language Generation

Translate the computer’s artificial language into text/audible speech.
Basically, it turns pieces of information into human language.

Major steps involved:

Entity extraction

Entity extraction involves segmenting a sentence to identify and extract entities, such as a person (real or fictional), organization, date, geographies (country, cities, states, etc.), events, etc.
E.g., Amazon is an e-commerce company based in Washington.
Here, NLP will identify Amazon as an organization and Washington as a country.

Syntactic analysis

The syntactic analysis involves parsing a sentence to understand the grammar and co-relation between words in that sentence. This step does a ‘context-free grammar check’, which is independent of the overall context of the text. It refers to the proper ordering of words. It forms in a tree format.
e.g., The dog saw a man in the park.
- At first, the sentence will be divided into two parts, i.e., noun phrase and verb phrase.
- Then, from noun phrases, it identifies determiners and nouns, and from verb phrases, it will find determiners, nouns, and prepositions.
- ‘The’ will be classified as a determiner and ‘dog’ as a noun in the noun phrase, and in the verb phrase, ‘saw’ will be classified as a verb, ‘a’ as a determiner, ‘man’ as a noun, ‘in’ as a preposition, ‘the’ as determiners and ‘park’ as a noun.

Semantic analysis

Semantic analysis finds the meaning of the sentence in a context-free form as an independent sentence. The sentence is treated as independent of overall paragraph/text. On the basis of knowledge based on grammar and entities, it finds the meaning of the text.

Sentiment Analysis

Once the syntactic and semantic analysis has been completed, we need to understand the sentiment behind every sentence. There are two terms that are widely used to understand the sentiment of a statement:
- Polarity: In this part -1 and +1 that is used to understand the negativity or positivity of a sentence, respectively.
- Magnitude: The value of the magnitude ranges from 0 to infinity which signifies the weight of the assigned polarity.

Pragmatic analysis

Pragmatic analysis involves analyzing the statement in relation to the preceding or succeeding statements or even the overall paragraph in order to understand its meaning.
For example, I cooked lunch today. However, I forgot to have it.
In this case, the pronoun ‘it’ is referring to ‘lunch’. For the system to understand what I actually ‘forgot’, it is important for it to understand the earlier statement.

Applications of NLP:

Google Assistant
Spam filter in emails
Reviews of any application
Google Translation
Text analysis
Grammarly

NLP in PowerBI

PowerBI has recently made the ‘Ask a Question’ feature inside the Power BI desktop. This allows to directly place the ask a question box in your reports. This helps in finding useful insights from the data. This feature also auto-generates sample suggestions to help one quickly get started. It provides the result of the data in Power BI in the form of charts and graphs.
Power BI Q&A only supports answering queries asked in English. The Q&A feature looks for answers in all of the datasets that are updated in the dashboard.

To get the correct answer to your question, one should keep in mind a few essential steps.

At first, add all the missing relationships between tables to get the correct answer.
Rename tables and columns so that questions can be asked in an easy way.
Fix incorrect data types. E.g., Date, country, etc., should be in the correct format.

Power BI uses NLP for this question, and answers feature to help one to understand data in an easier form.
Nowadays, NLP make life easy for us by all these applications through which we can easily understand and do several things.

Written by: