Databricks AI Summit 2023 Block Session

Block

Block Company outlined a series of real-world examples from applications like Square (payments), Cash App, Tidal, and TBD.

Federation

The first building block is federation, which involves abstracting away the complexity by standardizing interfaces, allowing for seamless communication between different apps and databases. In this case, Databricks serves as an intermediate gateway, similar to the Unity catalog.

Data Streaming and Computation

Each app is set to send all data to Kafka (provided by Confluent) with computation provided by Databricks. Databricks not only provides a compute layer but also handles historical data, which fuels the machine learning platform.

Composability

Composability is the concept of having small, interacting components. It allows high-level problems to be broken down into functional pieces, contributing to more efficient problem-solving. Examples mentioned include Marketing ML for personalized business experiences and Risk ML for fraud detection.

Lakehouse Layers

The Lakehouse model consists of several layers. The foundation is federated storage connected to the compute layer (Databricks jobs and clusters). This is followed by a feature calculation platform that takes business problems and translates them into actionable values. Then, there is a model training layer and an MLFlow layer for features and model training, providing a standardized method of storing and processing data. The final layer is model serving.

Examples of Lakehouse Layers

The compute, and federated storage layers feature payment aggregation, while features such as connected users are used for calculating fraud patterns. Risk training exists on top of the MLFlow layer, and the marketing layer is placed on the model training layer. Several feature providers are involved in the system. The LLM, LangChain, enriches the context from data and sits on the model serving layer.

Support for Open Source Models

The Lakehouse model supports open-source models, allowing for swapping between CPU, GPU, and models like MPT.

Applications

Among the applications discussed were form filling, code generation, and fine-tuning.

Federation​

Data Streaming and Computation​

Composability​

Lakehouse Layers​

Examples of Lakehouse Layers​

Support for Open Source Models​

Applications​

이 문서를 언급한 문서들