Theme: 
Data Lake Template for Reference Architecture, AWS, AZURE
Download .dragon1 file
NOTE: If you click on the .dragon1 file to open it, Windows will likely ask you for an app to associate with the .dragon1 extension. Choose Notepad if possible in the dialog.
Download Template (.dragon1 File) Upload & View .dragon1 File Convert .CSV to .dragon1 Edit Template

Data Lake Template for Reference Architecture, AWS, AZURE

What is Data Lake? (Definition)

A data lake is a system or repository of data, where the data is stored in its original (raw) format. Usually, this is in the form of files.

Often a data lake is a single store of all enterprise data including raw copies of source system data and transformed data used for tasks such as reporting, visualization, advanced analytics and machine learning. It is one of the most important architecture concepts to make artificial intelligence happen.

Create and View .dragon1 Files

The Dragon1 platform supports you to work on the platform in a repository application and in a designer application. Dragon1 also supports you to work with .dragon1 Files. Below you see a screenshot of the Visual Designer.

Any CSV file and any data in the Dragon1 repository can be converted into, imported and exported as .dragon1 Files. Below is an example screenshot of a .dragon1 File.

Here is a help page on the .dragon1 File structure:

Data Lake vs Data Warehouse

Data Lakes can contain structured data from relational databases (in rows and columns or object-oriented nodes) or semi-structured data (such as XML, JSON, CSV and logs) or any unstructured data (like PDFs, documents and email) and also binary data.

They are both widely used for the storage of big data, but they are not interchangeable. Lakes are often pools of data in the raw original format, the purpose for which is not yet defined. A data warehouse is more like a repository for structured and filtered data that has been processed for specific purposes.

Data Lake Architecture Diagram

The interactive example above is repeated below as a static diagram. It is an effective way of visualizing this concept. It is a solution reference architecture diagram.

data lake definition

Azure (from Microsoft) and AWS (from Amazon) are two well-known solutions that include all the capabilities required to make it easy for developers, data scientists, and analysts to store data of any size, shape, and speed, and do all types of processing and analytics across platforms and languages.


Symbols Set

If you purchase a user license of Dragon1, you have access to a modern set of symbols for creating a data lake architecture diagram, but also a data warehouse or any artifical intelligence solution diagram.

lake amazon data aws symbols set

Amazon (Data Lake AWS) symbols for Solution Architecture

You can make use of Amazon (AWS) symbols and create, for instance, a solution architecture for your Data Lake AWS, like the one below.

solution data architecture lakes aws

Tutorial: How to create a Data Lake Architecture Diagram

Creating a diagram for a data lake azure takes the following steps:

  • Login to the platform
  • Upload your .CSV data with the Import application on the platform
  • Optionally enrich your data in the Architecture Repository application
  • Select the template in the Visual Designer
  • Link your data (model) to the template
  • Optionally create some views for your data in the Visual Designer application
  • Publish your diagram to the Viewer application
  • Inform your stakeholders that a new diagram is available for them to comment and annotate and inform them how they can access it (let's say a URL link to use on their smartphone, iPad or laptop.)

Data Lake Azure Scenarios

Below you see one of the many storage scenarios possible on Azure, the Microsoft Cloud Service.

azure data architecture lake scenario

Dragon1 Viewer

Below you see javascript resources for the Dragon1 Viewer. You can choose to either make use of the viewer on the website or install the viewer locally.