Spark is written in Scala which is a functional programming language. Spark can be accessed with Python which is a procedural programming language but you will see an influence of functional programming in Spark. Functional programming is better for a distributed system. Functional programming reduces the possibilities of error by…

In big data terms, it is better to use a distributed system with several machines rather than doing the data processing in a single machine. It is better to understand the capabilities of the hardware like CPU, memory, SSD, and network to get a better knowledge about the big data…

A data warehouse is a system that enables us to store the data and support analytical processing. The data warehousing can be seen from the business perspective and technical perspective.

First, let us see data warehousing from a business perspective. Let us consider an online retail store like Amazon, there…

The data model is an abstraction that organizes the elements of data and how they are related to each other. Data modeling helps to organize the data in the databases. In the conceptual data modeling, the data are related based on the entities. It means the data is related based…

Relational Databases

Organization of data is very important to retrieve data from the databases. It helps us to use the data later. Creating a data model is an iterative process. You can alter the tables based on the data availability and redundancy in the data. Data modeling is used by the software…

Data engineering is the process of collecting data from different sources and finding an efficient way to store the data. Data engineers also run the data pipelines to transform and move the data from one place to another. The data is prepared by cleaning the data and made available to…

Merge sort algorithm is based on the divide and conquer algorithm. Although this algorithm consumes more memory, this algorithm is widely used nowadays because of its less complexity.

Below are the few important things to consider while using this algorithm,

  1. Merge sort is efficient to use when the data set…

Binary search algorithm which is also known as the logarithmic algorithm is an algorithm and it is used in finding an item from the sorted array. The binary search algorithm is faster than a linear search algorithm. Make sure you remember that the array must be sorted for the algorithm…

You might come across interviews where you could not crack a coding question or sometimes you might be preparing for a technical coding interview. It is very important that you must communicate with the interviewer at every step in the process of solving a problem. He/She might give you some…

Abinaya Rajesh

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store