apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about this month.
Apache Spark - A unified analytics engine for large-scale data processing
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Rocket Chip Generator
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
♞ lichess.org: the forever free, adless and open source chess server ♞
The Scala 3 compiler, also known as Dotty.
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
An open protocol for secure data sharing
Modern Load Testing as Code
Spark: The Definitive Guide's Code Repository
CMAK is a tool for managing Apache Kafka clusters
Scala language server with rich IDE features 🚀