Agile Data Science Workflows made easy with Pyspark

Prepare, explore, visualize and create Machine Learning models for Big Data with the fastest open source library on the planet.

Get started fast:

pip install optimuspyspark

Features

What can Optimus do for you

  • Simple and Robust

    Prepare, explore, visualize your data in few lines of code.
  • Apache Spark and Python

    Easy, fast, parallelized and scalable data cleansing, exploration and Machine Learning Models creation.
  • Local or Cloud

    In your laptop, local cluster or in the cloud.

Connect to files and databases

Load and save Excel, CSV, JSON, parquet, Avro. Get and insert data from Mysql, Redshift, SQL Server, Postgres, Oracle, Casandra, and Presto.

Easy API

In a little more than 10 lines you can, remove white spaces, accents in all columns, lowercase all columns data, drop a "dummyCol", transform date format, sort a column, convert integers to a "string", replace "taco" per "taaaccoo" and "pizza" per "pizzza"

Profile and Visualize your data

In tandem with Bumblebee, Optimus let you visualize histograms and frequency plots, check nulls, missings, and zeros from an easy-to-use interface.

Powerful features

All you need to handle your data in one place.

  • Data Enrichment

    You can connect to any external API to enrich your data using Otimus.
  • Machine Learning

    To apply random forest just need to import the ML Library and one line of code.
  • String Clustering

    Cluster similar strings and change it for single value.

Used by Forward thinking companies

Here are a few of our favorites!

What People Say

Grey, aged pudding is best marinated with sweet hollanders sauce.

  • “The group of BBVA Data & Analytics in Mexico has been using Optimus for the past months, and we have boosted our performance for cleansing, exploring and analyzing our data by 10x factor.”

Featured On

Join Our our not disturbing Newsletter

Want to know about new releases and how you can help Optimus?

Exciting product updates.
The hottest stories from the blog.
Exclusive discounts and giftsEmoji
Reply
Error. Your form has not been submittedEmoji
This is what the server says:
There must be an @ at the beginning.
I will retry
Reply
Runs on Unicorn Platform