gitaskhub

This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such as native implementation of last join and multi-window parallelization. Its APIs are fully compatible with the standard Spark. It is designed to be a component of OpenMLDB (https://github.com/4paradigm/OpenMLDB).

Language · Scala
License · Apache-2.0
Ask anything about this repo to start.

By chatting or signing in you agree to the Terms and chat-message logging (revocable in History).