delta-io / delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
See what the GitHub community is most excited about today.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
Apache Spark - A unified analytics engine for large-scale data processing
Modern Load Testing as Code
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Compile-time Language Integrated Queries for Scala
The pure asynchronous runtime for Scala
Source code for Twitter's Recommendation Algorithm
An open protocol for secure data sharing
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Spark: The Definitive Guide's Code Repository
The Scala 3 compiler, also known as Dotty.
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
sbt, the interactive build tool
workbench identity and access management
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Open-source high-performance RISC-V processor
Scala language server with rich IDE features 🚀
A Spark plugin for reading and writing Excel files