top of page

PAMELA KINGA @
THE BIG DATA LIFE
Big data trends | tools | services

Programming is no longer a skill exclusive to computer scientists. Many professionals across industries will find themselves interacting with software and applications programmatically using a variety of different languages. Here are the most popular open-source programs and their languages in use by data scientists today.




Python
R
Structured Query Language
Scala
Julia
See below in databases
Notebooks are interactive computational environments that allow users to develop and share data science projects through web-based technology. Notebook documents contain live code, narrative text, visualizations and equations and have become popular tools among data scientists, data engineers, and software developers.

Jupyter Notebook
R Markdown
Google Colab - a Jupyter notebook hosted in the cloud
Data scientists need to work with tools, data sets, and databases. Using languages such as Python and SQL, a data scientist will access databases through programs and/or notebooks. Below is a list of openly available databases in use today.
SQLite
MySQL
MariaDB
PostgreSQL
or
due to it's acquisition by Oracle, some of the original developers forked MySQL into MariaDB, committed to staying open.
bottom of page