Skip to content

Conversation

coyotte508
Copy link
Member

@coyotte508 coyotte508 commented Jul 16, 2025

cc @Kakulukian @assafvayner for viz, follow up to #1616

Based on https://github.com/huggingface/xet-core/blob/7e41fb0dd7cfb276222b9668d0b97a984647721e/spec/shard.md

Need to handle:

  • split into multiple shards when xorb or file info grows too big
  • uploading xorbs & shards (and we need to upload xorbs before shards referencing them)

@coyotte508
Copy link
Member Author

coyotte508 commented Aug 26, 2025

Merging, still experimental

Optimizations remaining:

@coyotte508 coyotte508 merged commit 754b069 into main Aug 26, 2025
3 of 6 checks passed
@coyotte508 coyotte508 deleted the shard-creation branch August 26, 2025 13:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants