6.0.4 #14615
DevinTDHa
announced in
Announcement
6.0.4
#14615
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📢 Spark NLP 6.0.4: MiniLMEmbeddings, DataFrame Optimization, and Enhanced PDF Processing
We are excited to announce the release of Spark NLP 6.0.4! This version brings advancements in text embeddings with the introduction of the MiniLM family, Spark DataFrame optimizations, and enhanced PDF document parsing. Upgrade to 6.0.4 to leverage these cutting-edge features and expand your NLP capabilities at scale.
Stay updated with our latest examples and tutorials by visiting our Medium - Spark NLP blog!
🔥 Highlights
🚀 New Features & Enhancements
Advanced Text Embeddings
This release introduces a new family of efficient text embedding models:
MiniLMEmbeddings
annotator, enabling the use of MiniLM models for generating highly efficient and effective sentence embeddings. These models are designed to provide strong performance while being significantly smaller and faster than larger alternatives, making them ideal for a wide range of NLP tasks requiring compact and powerful text representations. (Link to notebook)Spark DataFrame Optimization
Enhanced PDF Document Processing
The PDF Reader and PdfToText transformer have been significantly improved for more comprehensive and fault-tolerant document parsing. (Link to notebook)
❤️ Community Support
⚙️ Installation
Python
#PyPI pip install spark-nlp==6.0.4
Spark Packages
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x, and 3.4.x (Scala 2.12):
GPU
Apple Silicon
AArch64
Maven
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x, and 3.4.x:
spark-nlp-gpu:
spark-nlp-silicon:
spark-nlp-aarch64:
FAT JARs
What's Changed
Full Changelog: 6.0.3...6.0.4
This discussion was created from the release 6.0.4.
Beta Was this translation helpful? Give feedback.
All reactions