top of page
Open Tools
301.png
Languages

Programming is no longer a skill exclusive to computer scientists. Many professionals across industries will find themselves interacting with software and applications programmatically using a variety of different languages. Here are the most popular open-source programs and their languages in use by data scientists today.

PythonIcon.png
R.png
SQLicon.png
Scalaicon.png
juliaicon.png

Python

R

Structured Query Language

Scala

Julia

See below in databases

Notebooks

Notebooks are interactive computational environments that allow users to develop and share data science projects through web-based technology. Notebook documents contain live code, narrative text, visualizations and equations and have become popular tools among data scientists, data engineers, and software developers.

Jupytericon.png
rmarkdownicon.png
colabicon.png

Jupyter Notebook

R Markdown

Google Colab - a Jupyter notebook hosted in the cloud

Databases

Data scientists need to work with tools, data sets, and databases. Using languages such as Python and SQL, a data scientist will access databases through programs and/or notebooks. Below is a list of openly available databases in use today.

SQLiteicon.png
MySQLicon.png
PostgreSQL.png

SQLite

MySQL

MariaDB

PostgreSQL

or

due to it's acquisition by Oracle, some of the original developers forked MySQL into MariaDB, committed to staying open.

mariadbicon.png
SQL
mongodbicon.png
NOSQL
CassandraDBicon.png
Hbaseicon.png

MongoDB

Apache CassandraDB

Apache HBASE

  • instagram
  • linkedin
  • twitter

Join the Community!

White on Transparent.png
bottom of page