Docker init #20

chewy-zlai · 2024-09-25T16:47:15Z

Summary

This creates a compose.yaml file which can be used to bring up a Dynamodb local instance, a Spark master, and a Spark worker. It also creates a container which holds a parquet table with example data including some anomalies.

Checklist

Added Unit Tests
Covered by existing CI
Integration tested
Documentation update

Summary by CodeRabbit

New Features
- Introduced a Docker Compose setup for a local development environment with services for DynamoDB and Spark.
- Added a script for generating synthetic data for fraud detection analysis, including time series with anomalies.
- Created a README.md file with instructions for initializing demo data using Docker.
Bug Fixes
- Updated .gitignore to exclude .db files from version control.
Documentation
- Added detailed instructions in the new README.md for setting up the development environment.
Chores
- Created a requirements.txt file listing essential dependencies for the project.
- Introduced a startup script start.sh for executing data generation processes.

Now includes starting up a spark master and a spark worker

Now this includes a dockerfile to startup the python script to generate the data, and runs it to upload to the dynamo local server.

Focus on fraud case for demo. Make parquet file as data.parquet

Adds a readme, and keeps the docker container active so the parquet table can be accessed.

coderabbitai · 2024-09-25T16:49:07Z

Walkthrough

The changes introduce a new entry in the .gitignore to exclude .db files, a Dockerfile for setting up a containerized environment for a Python and Java application, a README.md for initializing demo data with Docker, a docker-compose.yaml for orchestrating services including DynamoDB and Spark, a Python script for generating synthetic fraud detection data, and a requirements.txt listing necessary dependencies for the project.

Changes

File	Change Summary
`.gitignore`	Added entry to ignore `*.db` files.
`docker-init/Dockerfile`	Introduced multi-stage build for a Python and Java application; sets up environment variables and executes a script.
`docker-init/README.md`	Added instructions for initializing demo data using Docker containers.
`docker-init/compose.yaml`	Configured Docker Compose with services for DynamoDB, Spark Master, Spark Worker, and the application.
`docker-init/generate_anomalous_data.py`	Added functionality for generating synthetic fraud detection data with defined schemas and data generation logic.
`docker-init/requirements.txt`	Specified project dependencies including `awswrangler`, `boto3`, `numpy`, and others.
`docker-init/start.sh`	Introduced a shell script to execute the data generation Python script and pass additional commands.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Docker
    participant DynamoDB
    participant Spark
    participant App

    User->>Docker: Run docker-compose up
    Docker->>DynamoDB: Start local instance
    Docker->>Spark: Start Spark Master
    Docker->>Spark: Start Spark Worker
    Docker->>App: Build application service
    App->>DynamoDB: Connect to database
    App->>Spark: Process data

Poem

🐰 In the garden of code, a change did sprout,
With Docker and scripts, there's no room for doubt.
A .gitignore to keep things neat,
And data generation, oh what a treat!
So hop along, let the containers play,
For in this new world, we’ll code all day! 🌼

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 20b3c01 and 8550c70.

📒 Files selected for processing (1)

docker-init/generate_anomalous_data.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

docker-init/generate_anomalous_data.py

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 16

🧹 Outside diff range and nitpick comments (8)

docker-init/requirements.txt (1)
3-3: Add a comment for the chronon-ai package

The chronon-ai package has a very low version number (0.0.82), which might indicate it's in early development.

Consider adding a comment to explain the purpose of this package and its relationship to the project. For example:
-chronon-ai==0.0.82
+chronon-ai==0.0.82  # Core project package - https://github.com/your-org/chronon-ai
This will help other developers understand the significance of this package in the project.
docker-init/README.md (4)
1-3: Correct spelling and consider adding more context

The introduction provides a good overview of the setup. However, there are a couple of points to address:

There's a spelling error in line 3: "anomolies" should be "anomalies".

Consider adding a brief explanation of why this setup is useful or what it's intended for. This context can help users understand the purpose of the demo data and how it relates to the project.

Here's a suggested revision for line 3:
- This directory holds code to setup docker containers for dynamoDB, a spark master, and a spark worker. It also creates a container which contains a parquet table with example data containing anomolies. To start, run:
+ This directory contains code to set up Docker containers for DynamoDB, a Spark master, and a Spark worker. It also creates a container with a parquet table containing example data with anomalies. This setup is useful for [brief explanation of purpose]. To start, run:
5-6: Add information about prerequisites and potential issues

The setup instruction is clear, but it would be helpful to provide more information to ensure a smooth setup process. Consider adding:

Prerequisites (e.g., Docker and Docker Compose installation)

Any specific version requirements

Potential issues users might encounter and how to resolve them

Here's a suggested expansion of the setup instructions:
## Setup

### Prerequisites
- Docker (version X.X or higher)
- Docker Compose (version X.X or higher)

### Instructions
1. Ensure Docker daemon is running
2. Navigate to this directory in your terminal
3. Run the following command:
docker-compose up
### Troubleshooting
- If you encounter [specific issue], try [solution]
- For more information, refer to [link to more detailed documentation if available]
7-11: Provide more detailed information about accessing and using the parquet file

The instructions for accessing the parquet table are clear, but they could be more comprehensive. Consider adding the following details:

The exact location of the data.parquet file within the container

Basic instructions on how to view or analyze the parquet file (e.g., using Python with pandas or PySpark)

A brief description of what data the parquet file contains and how it relates to the anomalies mentioned earlier

Here's a suggested expansion of the instructions:
## Accessing the Parquet Table

1. Open a new terminal window
2. Run the following command to access the container:
docker-compose exec app bash
3. Once inside the container, the parquet file is available at `/path/to/data.parquet`

4. To view or analyze the data, you can use Python with pandas or PySpark. For example:

```python
import pandas as pd
df = pd.read_parquet('/path/to/data.parquet')
print(df.head())
The data.parquet file contains [brief description of the data and its structure]. The anomalies in the data are [brief explanation of what constitutes an anomaly in this context].
---

`1-11`: **Overall assessment: Good start, but needs more detail**

The README provides a good introduction to setting up and accessing the demo data using Docker containers. However, to make it more comprehensive and user-friendly, consider implementing the suggestions in the previous comments:

1. Correct the spelling error and add more context about the purpose of this setup.
2. Expand the setup instructions with prerequisites and troubleshooting information.
3. Provide more detailed information about accessing and using the parquet file.

Additionally, consider adding sections on:
- The structure of the Docker setup (e.g., how DynamoDB, Spark master, and Spark worker interact)
- Any limitations or considerations users should be aware of
- Next steps or links to further documentation

These additions will greatly improve the usefulness of this README for users of various experience levels.

</blockquote></details>
<details>
<summary>docker-init/Dockerfile (1)</summary><blockquote>

`1-20`: **Summary of Dockerfile review**

Overall, the Dockerfile successfully sets up a multi-stage build combining Python and Java environments. However, there are several areas for potential improvement:

1. Update the base images to more recent versions of Python and OpenJDK.
2. Optimize the multi-stage build to copy only necessary files.
3. Review the contents of `requirements.txt` to ensure all dependencies are appropriate.
4. Be cautious with AWS credentials in environment variables and consider more secure alternatives.
5. Reconsider the approach of generating data during build time vs. at container startup.
6. Evaluate if `bash` is the most appropriate default command for this container.

These changes will help improve the security, efficiency, and flexibility of your Docker setup.


Consider breaking this Dockerfile into separate services (one for Python, one for Java) if they don't need to be tightly coupled. This could simplify your setup and make it easier to manage and scale each part independently.

</blockquote></details>
<details>
<summary>docker-init/compose.yaml (1)</summary><blockquote>

`48-48`: **Add a newline at the end of the file.**

To adhere to YAML best practices and improve compatibility with various tools, add a newline at the end of the file.

Simply add an empty line at the end of the file.

<details>
<summary>:toolbox: Tools</summary>

<details>
<summary>yamllint</summary><blockquote>

[error] 48-48: no new line character at the end of file

(new-line-at-end-of-file)

</blockquote></details>

</details>

</blockquote></details>
<details>
<summary>docker-init/generate_anomalous_data.py (1)</summary><blockquote>

`265-266`: **Update the print statement to reflect the correct data type**

The print statement mentions "user data," but the script generates fraud data. For clarity, consider updating the message.



Apply this diff to update the print statement:

```diff
-print("Successfully wrote user data to parquet")
+print("Successfully wrote fraud data to parquet")

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between ba49291 and c819ca3.

📒 Files selected for processing (6)

.gitignore (1 hunks)
docker-init/Dockerfile (1 hunks)
docker-init/README.md (1 hunks)
docker-init/compose.yaml (1 hunks)
docker-init/generate_anomalous_data.py (1 hunks)
docker-init/requirements.txt (1 hunks)

🧰 Additional context used

yamllint

docker-init/compose.yaml

[error] 48-48: no new line character at the end of file

(new-line-at-end-of-file)

Ruff

docker-init/generate_anomalous_data.py

1-1: decimal.Decimal imported but unused

Remove unused import: decimal.Decimal

(F401)

6-6: pyspark.sql.types.FloatType imported but unused

Remove unused import

(F401)

6-6: pyspark.sql.types.DecimalType imported but unused

Remove unused import

(F401)

7-7: pandas imported but unused

Remove unused import: pandas

(F401)

151-151: Comparison to None should be cond is None

Replace with cond is None

(E711)

165-165: Comparison to None should be cond is None

Replace with cond is None

(E711)

🔇 Additional comments not posted (5)

docker-init/requirements.txt (1)

1-7: LGTM: Well-structured requirements file

The requirements.txt file is well-structured with pinned versions for reproducibility. This is good practice for maintaining consistent development and deployment environments.
docker-init/Dockerfile (2)
11-16: Be cautious with AWS credentials in environment variables

Copying the generate_anomalous_data.py script into the image is fine. Using environment variables for configuration is a good practice. However, be cautious about setting AWS credentials as environment variables in the Dockerfile, even if they are placeholders.

For security reasons, consider using a secrets management solution or injecting these values at runtime instead of build time. Ensure that the placeholder values are never used in a production environment.

To verify the contents of the script and ensure it doesn't contain any sensitive information, you can use:
cat generate_anomalous_data.py
7-9: Dependency management approach looks good

The use of a requirements.txt file for managing Python dependencies is a good practice. Upgrading pip before installing dependencies is also beneficial.

To ensure all necessary dependencies are included and no unnecessary ones are present, please review the contents of the requirements.txt file. You can use the following command to display its contents:

✅ Verification successful

Dependency management approach verified

The docker-init/requirements.txt file contains the necessary and appropriately versioned Python dependencies. The approach of upgrading pip and installing dependencies is correctly implemented.
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
cat requirements.txt
Length of output: 91

Script:
#!/bin/bash
# Search for requirements.txt in the repository
fd requirements.txt
Length of output: 107

Script:
#!/bin/bash
# Display contents of docker-init/requirements.txt
cat docker-init/requirements.txt
Length of output: 131
.gitignore (1)
12-12: LGTM! Good practice to ignore .db files.

Adding *.db to .gitignore is a good practice, especially when working with local databases like DynamoDB. This prevents accidental commits of potentially large or sensitive database files.

Let's verify if there are any existing .db files that are already tracked:

Also, as the PR mentions documentation updates, let's check for any new or modified documentation files:

✅ Verification successful

Verified! The .gitignore addition of *.db successfully prevents tracking of .db files, and the documentation has been appropriately updated.
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Check for any tracked .db files
git ls-files '*.db'
Length of output: 21

Script:
#!/bin/bash
# Check for new or modified documentation files
git diff --name-only origin/main | grep -iE 'readme|\.md$|docs/'
Length of output: 88
docker-init/compose.yaml (1)

3-12: DynamoDB configuration looks good, but consider security implications.

The DynamoDB local configuration is well-structured and includes data persistence. However, running the container as root (line 12) could be a security risk in production environments.

Consider running the container with a non-root user. You can verify if this is possible by checking the Dockerfile of the DynamoDB local image:

If the output is an empty string or "root", consider creating a custom Dockerfile to set a non-root user.

docker-init/requirements.txt

docker-init/Dockerfile

docker-init/generate_anomalous_data.py

coderabbitai

Actionable comments posted: 3

🧹 Outside diff range and nitpick comments (1)

docker-init/compose.yaml (1)

48-48: Add newline at the end of the file.

The YAML file is missing a newline character at the end. While this doesn't affect functionality, it's a best practice to include it for better compatibility with various tools and systems.

Add a newline character at the end of the file.

🧰 Tools

yamllint

[error] 48-48: no new line character at the end of file

(new-line-at-end-of-file)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between c819ca3 and 55cee36.

📒 Files selected for processing (4)

docker-init/Dockerfile (1 hunks)
docker-init/compose.yaml (1 hunks)
docker-init/generate_anomalous_data.py (1 hunks)
docker-init/requirements.txt (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (3)

docker-init/Dockerfile
docker-init/generate_anomalous_data.py
docker-init/requirements.txt

🧰 Additional context used

yamllint

docker-init/compose.yaml

[error] 48-48: no new line character at the end of file

(new-line-at-end-of-file)

🔇 Additional comments not posted (1)

docker-init/compose.yaml (1)
41-48: ⚠️ Potential issue

Refine application service configuration for clarity and security.

The application service configuration needs improvement in several areas:

The command tail -F nothing is a placeholder and doesn't start your application.

While using fake AWS credentials is acceptable for local development, it's crucial to manage these carefully.

There's no clear indication that these are development-only settings.

Consider the following improvements:
app:
  build: .
  command: python your_app.py  # Replace with your actual start command
  environment:
    - DYNAMO_ENDPOINT=http://dynamo:8000
    # Development-only credentials. Do not use in production.
    - AWS_DEFAULT_REGION=${AWS_DEFAULT_REGION:-us-west-2}
    - AWS_ACCESS_KEY_ID=${AWS_ACCESS_KEY_ID:-fakeaccesskey}
    - AWS_SECRET_ACCESS_KEY=${AWS_SECRET_ACCESS_KEY:-fakesecretkey}
This allows you to override these values using environment variables when needed and clearly indicates that these are development settings.

To ensure no hardcoded credentials are used elsewhere in the project, run:

🧰 Tools

yamllint

[error] 48-48: no new line character at the end of file

(new-line-at-end-of-file)

docker-init/compose.yaml

docker-init/Dockerfile

docker-init/generate_anomalous_data.py

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)

docker-init/start.sh (2)
3-3: LGTM: Correct use of exec, consider handling no-argument case.

The use of exec "$@" is appropriate for a container entrypoint script, allowing flexible command execution.

Consider adding a default behavior when no arguments are provided:
-exec "$@"
+if [ $# -eq 0 ]; then
+    echo "No command provided. Exiting."
+    exit 0
+else
+    exec "$@"
+fi
This change ensures the script has a defined behavior even when run without arguments.

1-3: Consider adding a comment block for documentation.

While the script is relatively simple, it would benefit from a brief comment block explaining its purpose and usage. This aligns with the PR objective of updating documentation.

Consider adding a comment block at the beginning of the file:
 #!/bin/bash

+# This script is part of the Docker initialization process.
+# It generates anomalous data using a Python script and then
+# executes any additional commands passed as arguments.
+#
+# Usage: ./start.sh [additional commands]
+
 python3 generate_anomalous_data.py
 exec "$@"
This addition provides context for future maintainers and users of the script.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 55cee36 and 9a643ca.

📒 Files selected for processing (2)

docker-init/Dockerfile (1 hunks)
docker-init/start.sh (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

docker-init/Dockerfile

🔇 Additional comments not posted (1)

docker-init/start.sh (1)

1-1: LGTM: Correct shebang for a Bash script.

The shebang #!/bin/bash is appropriate for a Bash script and ensures the script will be executed with Bash.

docker-init/start.sh

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)

docker-init/Dockerfile (2)
3-6: Consider combining RUN commands and fixing formatting

The Python setup looks good, but there are a couple of minor improvements we can make:

Remove the extra space before pip3 install on line 6.

Consider combining the yum install and pip install commands into a single RUN instruction to reduce the number of layers in the image.

Here's a suggested improvement:
-RUN yum install -y python3
-RUN  pip3 install --upgrade pip; pip3 install -r requirements.txt
+RUN yum install -y python3 && \
+    pip3 install --upgrade pip && \
+    pip3 install -r requirements.txt
This change will create fewer layers in the Docker image, potentially reducing its size.

17-17: LGTM: Appropriate use of ENTRYPOINT

Setting the ENTRYPOINT to the start.sh script is a good practice. It allows for additional setup or configuration steps before running the main application.

To improve maintainability, consider adding a comment in the Dockerfile or separate documentation explaining the purpose and contents of the start.sh script. This will help other developers understand the container's startup process.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 9a643ca and 33fbd26.

📒 Files selected for processing (2)

docker-init/Dockerfile (1 hunks)
docker-init/requirements.txt (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

docker-init/requirements.txt

🔇 Additional comments (2)

docker-init/Dockerfile (2)
1-2: LGTM: Appropriate base image selection

The use of amazoncorretto:17 as the base image is a good choice, aligning with AWS EMR standards and providing Java 17 support.

12-15: Ensure secure handling of sensitive environment variables

While setting placeholder values for AWS credentials in the Dockerfile is common during development, it's crucial to handle these securely in production environments.

For production use, consider the following best practices:

Use environment variable injection at runtime instead of hardcoding values in the Dockerfile.

Utilize AWS IAM roles for EC2 instances or ECS tasks when deploying to AWS, eliminating the need for hardcoded credentials.

For local development, use a .env file in conjunction with docker-compose to manage these variables.

To verify the current usage of these environment variables, you can run:

Ensure that the actual secret values are never committed to version control or baked into the Docker image.
✅ Verification successful

To ensure that your AWS credentials are handled securely across all environments, please run the following script to check how these environment variables are utilized in the codebase:

shell
#!/bin/bash

Search for references to AWS credential environment variables in the codebase

rg -i 'AWS_ACCESS_KEY_ID|AWS_SECRET_ACCESS_KEY' --context 2
Please run the following corrected script to verify how the AWS credential environment variables are used in the codebase:

AWS credentials are securely managed with placeholder values confined to configuration files

The AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY are set with placeholder values in both docker-init/Dockerfile and docker-init/compose.yaml. There are no additional references to these credentials elsewhere in the codebase, ensuring that sensitive information is not exposed or mismanaged.
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Search for usage of AWS credential environment variables
rg -i 'AWS_ACCESS_KEY_ID|AWS_SECRET_ACCESS_KEY'
Length of output: 309

Script:
Length of output: 98

Script:
#!/bin/bash
# Search for references to AWS credential environment variables in the codebase
rg -i 'AWS_ACCESS_KEY_ID|AWS_SECRET_ACCESS_KEY' --context 2
Length of output: 653

Script:
#!/bin/bash
# Search for references to AWS credential environment variables in the codebase
rg -i 'AWS_ACCESS_KEY_ID|AWS_SECRET_ACCESS_KEY' --context 2
Length of output: 653

docker-init/Dockerfile

…nto docker-init

piyush-zlai

@chewy-zlai - couple of minor comments. Looks largely good though

docker-init/compose.yaml

docker-init/Dockerfile

docker-init/generate_anomalous_data.py

… for now.

…docker is restarted instead of recreated).

Docker init for POC

Doour clientser init for POC

chewys1024 added 5 commits September 19, 2024 11:07

Starter for bringing up dynamo and spark and generating anomalous data

04b776b

WIP

7708197

Now includes starting up a spark master and a spark worker

WIP

cf3c965

Now this includes a dockerfile to startup the python script to generate the data, and runs it to upload to the dynamo local server.

WIP

866f6aa

Focus on fraud case for demo. Make parquet file as data.parquet

Docker Init Sample Anomalous Data

c819ca3

Adds a readme, and keeps the docker container active so the parquet table can be accessed.

coderabbitai bot reviewed Sep 25, 2024

View reviewed changes

Coderabbit changes

55cee36

coderabbitai bot reviewed Sep 25, 2024

View reviewed changes

docker-init/compose.yaml Outdated Show resolved Hide resolved

docker-init/compose.yaml Outdated Show resolved Hide resolved

docker-init/compose.yaml Outdated Show resolved Hide resolved

nikhil-zlai reviewed Sep 25, 2024

View reviewed changes

docker-init/Dockerfile Outdated Show resolved Hide resolved

nikhil-zlai reviewed Sep 25, 2024

View reviewed changes

docker-init/generate_anomalous_data.py Show resolved Hide resolved

Move python script to a startup script so fresh data is generated.

9a643ca

coderabbitai bot reviewed Sep 25, 2024

View reviewed changes

docker-init/start.sh Outdated Show resolved Hide resolved

chewys1024 added 2 commits September 26, 2024 01:30

Switch to amazoncorretto

81171af

Clean up Dockerfile

33fbd26

coderabbitai bot reviewed Sep 26, 2024

View reviewed changes

docker-init/Dockerfile Outdated Show resolved Hide resolved

chewy-zlai and others added 4 commits September 26, 2024 12:42

Merge branch 'main' into docker-init

12566c3

More coderabbit changes.

e76f183

Merge branch 'docker-init' of https://github.com/zipline-ai/chronon i…

e31f288

…nto docker-init

Add security features to spark

22b9139

piyush-zlai reviewed Sep 27, 2024

View reviewed changes

docker-init/compose.yaml Show resolved Hide resolved

docker-init/compose.yaml Outdated Show resolved Hide resolved

docker-init/Dockerfile Show resolved Hide resolved

docker-init/generate_anomalous_data.py Outdated Show resolved Hide resolved

chewys1024 added 2 commits September 27, 2024 17:12

Add parquet-tools, and turn back off SSL and RPC Encryption for Spark…

20b3c01

… for now.

Overwrite parquet at /app/data/ if it already exists (to handle when …

8550c70

…docker is restarted instead of recreated).

piyush-zlai approved these changes Sep 30, 2024

View reviewed changes

chewy-zlai merged commit dba3332 into main Sep 30, 2024
7 checks passed

chewy-zlai deleted the docker-init branch September 30, 2024 16:25

This was referenced Sep 30, 2024

Wire up Play frontend + server in docker setup #24

Merged

[WIP] Ken frontend #21

Closed

Ken frontend #25

Closed

Frontend: Build, wireframe, config #26

Merged

This was referenced Oct 3, 2024

Frontend tests #31

Closed

Add DynamoDB kv store implementation #33

Merged

Numpy vuln #39

Merged

This was referenced Oct 10, 2024

Parquet to dynamo #42

Closed

Register DynamoDB models at Play application startup #44

Merged

WIP Key value parquet to dynamo #46

Closed

This was referenced Nov 1, 2024

Table creation #47

Merged

Driver Summarizer #62

Merged

This was referenced Nov 18, 2024

Workflow to Push Containers to Canary #68

Closed

gitignore elasticsearch-data #81

Merged

This was referenced Dec 4, 2024

include note about java version #104

Merged

update some readme stuff #114

Merged

Upgrade Quickstart And Push to Cloud #116

Merged

This was referenced Dec 13, 2024

Setup changesets for CHANGELOG.md management #129

Closed

Rework BigTableKV Store & GCP Api #135

Merged

coderabbitai bot mentioned this pull request Dec 23, 2024

Build zipline-ai with Commands "zipline compile" and "zipline run" #161

Merged

2 tasks

coderabbitai bot mentioned this pull request Mar 16, 2025

feat: zipline init to create project scaffolding #512

Merged

4 tasks

kumar-zlai pushed a commit that referenced this pull request Apr 25, 2025

Merge pull request #20 from zipline-ai/docker-init

d58ca65

Docker init for POC

kumar-zlai pushed a commit that referenced this pull request Apr 29, 2025

Merge pull request #20 from zipline-ai/docker-init

b33cdd6

Docker init for POC

chewy-zlai added a commit that referenced this pull request May 16, 2025

Merge pull request #20 from zipline-ai/doour clientser-init

1cbc010

Doour clientser init for POC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docker init #20

Docker init #20

chewy-zlai commented Sep 25, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 25, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

Search for references to AWS credential environment variables in the codebase

piyush-zlai left a comment

Docker init #20

Docker init #20

Conversation

chewy-zlai commented Sep 25, 2024 • edited by coderabbitai bot Loading

Summary

Checklist

Summary by CodeRabbit

coderabbitai bot commented Sep 25, 2024 • edited Loading

Walkthrough

Changes

Sequence Diagram(s)

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

Search for references to AWS credential environment variables in the codebase

piyush-zlai left a comment

Choose a reason for hiding this comment

chewy-zlai commented Sep 25, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 25, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)