[RORDEV-1410] Data stream audit sink setup improvements #1089

mateuszkp96 · 2025-03-25T16:41:50Z

No description provided.

mateuszkp96 · 2025-03-25T18:04:55Z

I added changes for the es816x module. I will port to other modules when the solution is accepted

coutoPL · 2025-04-04T18:23:16Z

core/src/main/scala/tech/beshu/ror/accesscontrol/audit/sink/AuditDataStreamCreator.scala

 import tech.beshu.ror.implicits.*
 import tech.beshu.ror.utils.RefinedUtils.*

 import java.util.concurrent.TimeUnit

 final class AuditDataStreamCreator(services: NonEmptyList[DataStreamService]) extends Logging {

-  def createIfNotExists(dataStreamName: RorAuditDataStream): Task[Unit] = {
-    services.toList.traverse(createIfNotExists(_, dataStreamName)).map((_: List[Unit]) => ())
+  def createIfNotExists(dataStreamName: RorAuditDataStream): Task[Either[String, Unit]] = {


let's create at least type def for the String (aka Message or sth like that)

coutoPL · 2025-04-04T18:26:35Z

core/src/main/scala/tech/beshu/ror/accesscontrol/audit/sink/AuditDataStreamCreator.scala

+      .toList
+      .map(createIfNotExists(_, dataStreamName))
+      .sequence
+      .map(_.sequence.map((_: List[Unit]) => ()))


I don't see any monoid here, so I guess we miss potential error messages (when more than one service fails)

coutoPL · 2025-04-04T18:29:38Z

core/src/main/scala/tech/beshu/ror/accesscontrol/audit/sink/EsDataStreamBasedAuditSink.scala

-      .map((_: Unit) => new EsDataStreamBasedAuditSink(serializer, rorAuditDataStream, auditSinkService))
+      .flatMap {
+        case Right(()) => Task.delay(new EsDataStreamBasedAuditSink(serializer, rorAuditDataStream, auditSinkService))
+        case Left(errorMsg) => Task.raiseError(new IllegalStateException(errorMsg))


Is it an illegal state? I think it's possible case

coutoPL · 2025-04-04T18:32:54Z

core/src/main/scala/tech/beshu/ror/es/DataStreamService.scala

-      _ <- createDataStream(settings.dataStreamName)
-    } yield ()
-  }
+      _ <- createIfAbsent(


maybe we can format it like that:

_ <- createIfAbsent( checkIfResourceExists = checkIndexLifecyclePolicyExists(settings.lifecyclePolicy.id), createResource = createIndexLifecyclePolicy(settings.lifecyclePolicy), onNotAcknowledged = Failure(s"Unable to determine if the index lifecycle policy with ID '${settings.lifecyclePolicy.id.show}' has been created") )

coutoPL · 2025-04-04T18:38:11Z

core/src/main/scala/tech/beshu/ror/es/DataStreamService.scala


  def checkDataStreamExists(dataStreamName: DataStreamName.Full): Task[Boolean]

  protected def createDataStream(dataStreamName: DataStreamName.Full): Task[CreationResult]

+  protected def checkIndexLifecyclePolicyExists(policyId: NonEmptyString): Task[Boolean] = Task.pure(false)


why the default implementation? I'm not sure if we need it
(here, and below too)

coutoPL · 2025-04-04T18:47:57Z

core/src/test/scala/tech/beshu/ror/unit/es/DataStreamServiceTest.scala

+  "A ReadonlyREST data stream service" when {
+    "fully setup data stream called" should {
+      "not attempt to create data stream when one exists" in {
+        tryToCreateDataStream(DataStreamsMocks.alreadyExists)


The tests are great, but IMO we should do sth to improve the readability. Now, it's hard for the reader to grasp what each test does without looking into runTests, testSuccessfulDataStreamSetup, and other method implementations.

Maybe we should try to maintain the given-when-then style? And try to explicitly show what the mock returns, what is called, and how we assert the result.

Obviously, we will have to repeat ourselves many times, but maybe we will be able to achieve a state when the reader can read the test, and without looking into some private methods used in it, they will be able to understand what the test does and how.

WDYT?

coutoPL · 2025-04-04T18:53:34Z

es816x/src/main/scala/tech/beshu/ror/es/services/EsDataStreamService.scala

+          .asInstanceOf[java.util.List[Object]]
+          .asScala
+          .map { obj =>
+            val policy = ReflecUtils.invokeMethod(obj, obj.getClass, "getLifecyclePolicy")


Let's not use ReflecUtils. We should consider it as deprecated. Let's use org.joor.Reflect instead

coutoPL · 2025-04-04T18:55:32Z

es816x/src/main/scala/tech/beshu/ror/es/services/EsDataStreamService.scala

In this file, we use Instant.now. Let's inject a clock to the class and use it instead

coutoPL · 2025-04-04T18:58:53Z

es816x/src/main/scala/tech/beshu/ror/es/services/RestClientDataStreamService.scala

private def failure(message: String) = { Task.raiseError(new IllegalStateException(message)) }

it's not an invalid state, isn't it?

coutoPL · 2025-04-04T19:01:07Z

es816x/src/main/scala/tech/beshu/ror/es/services/RestClientDataStreamService.scala

+          val policies = response.entityJson.obj.keySet
+          Task.pure(policies.contains(policy))
+        case response =>
+          failure(s"Cannot get ILM policy [$policy] - response code: ${response.statusCode}")


It'd be nice to add the response body too.
I just wonder if we should add to the message or as a debug log.

Datastream serivce improvements

0699109

mateuszkp96 changed the base branch from develop to epic/RORDEV-1263 March 25, 2025 16:41

Revert changes

c3a9488

mateuszkp96 requested a review from coutoPL March 25, 2025 18:05

coutoPL reviewed Apr 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RORDEV-1410] Data stream audit sink setup improvements #1089

[RORDEV-1410] Data stream audit sink setup improvements #1089

mateuszkp96 commented Mar 25, 2025

mateuszkp96 commented Mar 25, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

coutoPL Apr 4, 2025

[RORDEV-1410] Data stream audit sink setup improvements #1089

Are you sure you want to change the base?

[RORDEV-1410] Data stream audit sink setup improvements #1089

Conversation

mateuszkp96 commented Mar 25, 2025

mateuszkp96 commented Mar 25, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment