Replies: 1 comment
-
HI @Jay-ju , thanks for writing this up. I think this approach makes sense. As a first step, could you share some sample workloads that would benefit from this? I'd like to set up some baseline benchmarks that we can work towards improving. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Technical Proposal for Daft-Lance Integration
Current Implementation
Daft effectively utilizes URL-based lazy processing to perform joins, aggregations, and complex operations (e.g., window functions) in multi-modal workflows. This paradigm delays downloading content until its actual use, which offers significant flexibility.
Identified Storage-Side Challenges
Proposed Integration Approach
To harness the complementary strengths of Daft and Lance, we suggest:
Replace URL-based download interfaces with row_id-based point queries via lance_take(),
Substitute upload interfaces with Lance's atomic update() operations.
Anticipated Advantages
@jaychia @kevinzwang @universalmind303 WDYT?
Beta Was this translation helpful? Give feedback.
All reactions