You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Logging fix -- make root cause more clear if label job has misaligned dates (#611)
## Summary
Improve logging when labelDs - Window does not overlap with join data
Add step day functionality to labelJoin
## Checklist
- [ ] Added Unit Tests
- [x] Covered by existing CI
- [ ] Integration tested
- [ ] Documentation update
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
- **New Features**
- Enhanced flexibility in date handling for label join operations,
allowing for an optional start date override.
- Introduction of new JSON configuration files for the
`quickstart.purchases.v1` dataset, facilitating structured data
processing and analysis.
- **Bug Fixes**
- Improved diagnostic messaging to clearly indicate when required data
elements are missing during processing.
- Enhanced error detection to ensure necessary computation windows are
present before proceeding.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Co-authored-by: david-zlai <[email protected]>
Co-authored-by: ezvz <[email protected]>
|Found existing partitions of join output: ${existingJoinPartitions.mkString(", ")}
128
137
|
129
-
|Therefore unable to compute the labels for ${missingWindowToOutputs.keys.mkString(", ")}
138
+
|Required dates are computed based on label date (the run date) - window for distinct windows that are used in label parts.
130
139
|
131
-
|For requested ds: $labelDs
140
+
|In this case, the run date is: $labelDs, and given the existing partitions we are unable to compute the labels for the following windows: ${missingWindowToOutputs.keys
141
+
.mkString(", ")} (days).
132
142
|
133
-
|Proceeding with valid windows: ${computableWindowToOutputs.keys.mkString(", ")}
143
+
|""".stripMargin
144
+
145
+
// If there are no dates to run, also throw that error
146
+
require(
147
+
computableWindowToOutputs.nonEmpty,
148
+
s"""$baseLogString
149
+
|
150
+
|There are no partitions that we can run the label join for. At least one window must be computable.
134
151
|
152
+
|Exiting.
135
153
|""".stripMargin
136
154
)
137
155
138
-
require(
139
-
computableWindowToOutputs.isEmpty,
140
-
"No valid windows to compute labels for given the existing join output range."+
141
-
s"Consider backfilling the join output table for the following days: ${missingWindowToOutputs.values.map(_.joinDsAsRange.start)}."
156
+
// Else log what we are running, but warn about missing windows
157
+
logger.warn(
158
+
s"""$baseLogString
159
+
|
160
+
|Proceeding with valid windows: ${computableWindowToOutputs.keys.mkString(", ")}
0 commit comments