Skip to content

Releases: databricks/megablocks

v0.3.2

10 Oct 22:32
Compare
Choose a tag to compare

What's Changed

  • Support for bfloat16
  • Optimizations for top_k > 1
  • Support for fully-sharded data parallelism
  • Support tensor model parallelism when expert_parallel_world_size > num_experts
  • Optimizations for activation memory
  • Support activation quantization (thanks @dblalock!)
  • Optimizations for SM90 (Hopper)
  • Lots of bug fixes, cleanup and small optimizations

New Contributors

Full Changelog: v0.1...v0.3.2

Version 0.1

01 May 15:14
Compare
Choose a tag to compare
Version 0.1 Pre-release
Pre-release

Initial release documenting repository state prior to MLSys'23 camera-ready publication.