Disable Per-SQL summary text output #1530
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Ahmed Hussein (amahussein) [email protected]
Fixes #1527
Disable the text format output generated per-sql. The target is to reduce the noise of the stdout and improve the performance of the core-tools
Impact on the output:
rapids_4_spark_qualification_output_persql.log
is not generated anymore by the qualToolAppName
fromrapids_4_spark_qualification_output_persql.csv
rapids_4_spark_qualification_output_persql.log
can still be generated by the RunningQualificationAppImpact on Performance and usability:
Buffer<String, Int>
Details
This pull request to
core/src/main/scala/com/nvidia/spark/rapids/tool/qualification/QualOutputWriter.scala
includes changes to improve the performance and structure of the SQL CSV report generation and related functionality. The most important changes include the removal of the application name from the per-SQL report, the optimization of string concatenation and object allocation, and updates to the test expectations to reflect these changes.Improvements to report generation:
core/src/main/scala/com/nvidia/spark/rapids/tool/qualification/QualOutputWriter.scala
: Added a new methodwritePerSqlCSVReport
to generate the per-SQL CSV report with optimized string concatenation and reduced object allocations. Removed the application name from the report and updated the header construction accordingly.Changes to related classes:
core/src/main/scala/com/nvidia/spark/rapids/tool/qualification/Qualification.scala
: Removed the call towritePerSqlTextReport
and updated the call towritePerSqlCSVReport
to reflect the new method signature.core/src/main/scala/com/nvidia/spark/rapids/tool/qualification/RunningQualOutputWriter.scala
: Updated the header construction to remove the application name size.core/src/main/scala/com/nvidia/spark/rapids/tool/qualification/RunningQualificationApp.scala
: Simplified the header construction by removing the application name size.Updates to test expectations