Anssr Data Platform

Anssr Data Platform

  • Docs
  • Support
  • Help
  • Blog
  • Contact

›Analytics

Getting Started

  • Deploying an Anssr Bundle
  • Getting Started with Anssr Analytics
  • Getting Started with Anssr and Hadoop
  • Installing Juju
  • Getting Started with Hue

Analytics

  • Creating A Saiku Schema
  • Saiku Report Generation
  • What Is OLAP?
  • Using Apache Drill

Hadoop

  • Map Reduce 101
  • HDFS and Distributed Data
  • Hadoop Interfaces

Druid

  • Data Ingestion With Druid and Hadoop

Platform Guides

  • Setting an SLA

Further Help

  • About Anssr IO
  • Frequently Asked Questions
  • Getting Support

LXC & LXD Containers

  • About LXC & LXD

What Is OLAP?

OLAP, which stands for Online Analytics Processing is a category of database processing to assist Business Intelligence. It enables organisations to make effective decisions about the strategic direction of the company by providing insights into performance and highlighting areas where improvements are needed.

OLAP tools are optimised for querying and reporting, unlike OLTP (Online Transactional Processing) which primarily focus on transaction based tasks. By using OLAP tools, it allows its users to analyse multidimensional data interactivity from many different perspectives. Find trends in your data by getting a big picture view from across the organisation as a whole and get assistance with future decision making by creating “what if” scenarios.

OLAP vs OLTP

In usual use cases OLAP would involve vast amounts of data with large queries that run for far longer than a typical OLTP solution. The primary focus for OLTP is speed, accuracy and the ability to support large numbers of users. OLAP on the other hand is focused on complex aggregate queries across much larger data sets.

OLTPOLAP
Mostly updatesMostly reads
Large amount of small transactionsQueries long, complex
Mb - Tb of dataGb - Tb of data
Raw dataSummarised, consolidated data
Clerical usersDecision-makers, analysts as users
Up-to-date dataHistorical data
Consistency, recoverability criticalAccuracy

Populating an OLAP Database

OLAP databases will usually be populated via a batch query. By this we mean that all the data gets inserted at one time. In contrast to this, OLTP applications constantly have data being uploaded, updated and removed. OLAP and OLTP databases are used concurrently as you will often find that the data that populates an OLAP database has originated from an OLTP application. Batch queries are used to search the system from where the data comes from (OLPT) and then imports it into an OLAP database.

OLAP Cube

At the core of any OLAP tool is the OLAP cube. The way that the data is represented is arranged into a cubic shape, represents how OLAP follows the principle of multidimensional analysis which delivers a data structure designed for fast and effective data analysis which is in contrast to the limitations found when using relational databases.

OLAP cubes are made up of dimensions and measures:

Dimensions

A dimension is a perspective through which to analyse the data that resides within an OLAP cube. Each dimension is a cube is built on fields or columns found in the data. A dimension can be composed of a single attribute, or more. The data can exist on a single level of granularity or on multiple levels stored as a hierarchy.

Measures

A measure is a numeric piece of information attached to business fact that has occured or a record that is stored. Measures, along with dimensions are the foundation of most data analysis. Simply put, measures provide the number part of a cube while the dimensions provide the description. Measures are recorded individually and can then be aggregated to higher levels.

Popular OLAP Tools

  • Saiku Analytics
  • Microsoft Analysis Services
  • Sisense
  • Inetsoft
  • Halo
  • Apache Kylin
← PreviousNext →
  • Dimensions
  • Measures
Docs
Getting StartedGuides
Community
Anssr CommunitySlack Channel
More
BlogGitLab
Follow @spiculedata
Anssr Data Platform logo
Copyright © 2018 Anssr