apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about today.
Apache Spark - A unified analytics engine for large-scale data processing
Open-source high-performance RISC-V processor
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Chisel: A Modern Hardware Design Language
The Community Maintained High Velocity Web Framework For Java and Scala.
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
The Streaming-first HTTP server/module of Akka
The Scala 3 compiler, also known as Dotty.
Protocol buffer compiler for Scala.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
A better build tool for Java, Scala and Kotlin: Simpler than Maven, easier than Gradle, with 3-7x faster dev workflows than other JVM build tools
CLI tool for coding agents and developers to query the public API of any Maven JVM dependency — get symbol signatures, list packages, search by name, and inspect dependency trees. Powered by Coursier and tasty-query.
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
Apache DataFusion Comet Spark Accelerator
The leader in Customer Data Infrastructure
Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Human-AI Collaborative Data Science Using Visual Workflows
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
sbt, the interactive build tool