Quantico: Data Wrangling
Quantico Shiny App

Quantico: Data Wrangling

Quantico offers a wide range of operations that assist in the data science & analytics process. One of those major components is data wrangling and that's what I'll be covering in this article. I classify data wrangling as separate from feature engineering which I'll cover in another article. So keep an eye out for that one later on.

Data wrangling often times begins and ends with SQL but with strong data wrangling packages available it can make sense to shift those tasks to R or Python. Data pipelines can be created more quickly, can be created with more flexibility, can be more dynamic, and can be more easily reused with the storage of functions that are common across projects.

With Quantico, data wrangling makes use of the R data.table package under the hood and a user has a variety of tasks available to them that I categorize into five buckets (listed below). The methods list is certainly not comprehensive. If you would like to see additional methods made available, please make requests on the GitHub repo in the issues section: https://github.com/AdrianAntico/Quantico/issues.

Lastly, after you run your data wrangling operations, feel free to jump to the Code Print output tab to retrieve the code that was used to run the operations. If you don't know data.table this is an easy way to help get up to speed.

Top Level Data Wrangling Categories

  • Shrink
  • Grow
  • Datasets
  • Pivot
  • Columns

Article content
Data Wrangling Methods Categories

Shrink

  1. Aggregate data (8 stats methods available)
  2. Subset rows
  3. Subset columns
  4. Sampling & Stratified Sampling

Grow

  1. Join (8 types including rolling)
  2. Union

Datasets

  1. Partition Data
  2. Sort Data
  3. Model Data Prep
  4. Remove Data

Pivot

  1. Cast from long to wide
  2. Melt from wide to long

Columns

  1. Type Casting
  2. Rename Columns
  3. Add a Time Trend Column
  4. Concatenate Columns



To view or add a comment, sign in

More articles by Adrian Antico

  • Python QuickEcharts

    Creating Echarts visualizations in Python is doable but a bit cumbersome. The basis package and API I'm utilizing under…

    15 Comments
  • Quantico: Multiclass Evaluation Plots

    One of the great benefits from the R Shiny App Quantico (dependent upon the package AutoPlots) are the model evaluation…

    4 Comments
  • Quantico: Plotting

    The first article I wrote on plotting can be found here: https://www.linkedin.

  • Quantico: Code Generation Part_2 - Data Wrangling

    With the Quantico Shiny App (https://github.com/AdrianAntico/Quantico) you can return the code that runs behind the…

    4 Comments
  • Quantico: Code Generation Part_1 - Plotting

    With the Quantico Shiny App (https://github.com/AdrianAntico/Quantico) you can return the code that runs behind the…

    11 Comments
  • Quantico: Hypothesis Testing

    With the Quantico Shiny App, you can run hypothesis testing and generate output to help you with decision making and…

  • Quantico: Forecasting Panel & Single Series Data

    I've written several articles so far that highlights some top level functionality with the Quantico Shiny App. You can…

    1 Comment
  • Quantico: Machine Learning

    The new shiny app Quantico comes with machine learning capabilities. Check out the GitHub repo here: https://github.

    3 Comments
  • Quantico: Unsupervised Learning

    Quantico offers a wide range of operations that assist in the data science & analytics process. One of those major…

  • Quantico: Feature Engineering

    Quantico offers a wide range of operations that assist in the data science & analytics process. One of those major…

Explore content categories