Skip to content

[PRE REVIEW]: HeXtractor: Extracting Heterogeneous Graphs from Structured and Textual Data for Graph Neural Networks #7966

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
editorialbot opened this issue Mar 31, 2025 · 28 comments
Assignees
Labels
pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning

Comments

@editorialbot
Copy link
Collaborator

editorialbot commented Mar 31, 2025

Submitting author: @maddataanalyst (Filip Wójcik)
Repository: https://github.com/maddataanalyst/hextractor
Branch with paper.md (empty if default branch): paper
Version: v1.0.1
Editor: @Nikoleta-v3
Reviewers: @jboynyc, @cjbarrie
Managing EiC: Chris Vernon

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/2e206399373011922a60b595de64df77"><img src="https://joss.theoj.org/papers/2e206399373011922a60b595de64df77/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/2e206399373011922a60b595de64df77/status.svg)](https://joss.theoj.org/papers/2e206399373011922a60b595de64df77)

Author instructions

Thanks for submitting your paper to JOSS @maddataanalyst. Currently, there isn't a JOSS editor assigned to your paper.

@maddataanalyst if you have any suggestions for potential reviewers then please mention them here in this thread (without tagging them with an @). You can search the list of people that have already agreed to review and may be suitable for this submission.

Editor instructions

The JOSS submission bot @editorialbot is here to help you find and assign reviewers and start the main review. To find out what @editorialbot can do for you type:

@editorialbot commands
@editorialbot editorialbot added pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning labels Mar 31, 2025
@editorialbot
Copy link
Collaborator Author

Hello human, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

@editorialbot
Copy link
Collaborator Author

Software report:

github.com/AlDanial/cloc v 1.98  T=0.04 s (1060.1 files/s, 188778.3 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Python                          15            453            730           1816
HTML                             3            228              6            933
Markdown                        10            142              0            362
Jupyter Notebook                 3              0           1505            343
TeX                              1              2              0            221
YAML                             6             13             12            123
TOML                             1              9              0             47
-------------------------------------------------------------------------------
SUM:                            39            847           2253           3845
-------------------------------------------------------------------------------

Commit count by author:

    16	Filip Wójcik
     6	Filip
     6	maddataanalyst
     5	Filip Wójcik, PhD
     2	Marcin Malczewski
     1	Filip W.

@editorialbot
Copy link
Collaborator Author

Paper file info:

📄 Wordcount for paper.md is 1030

✅ The paper includes a Statement of need section

@editorialbot
Copy link
Collaborator Author

License info:

✅ License found: BSD 3-Clause "New" or "Revised" License (Valid open source OSI approved license)

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

✅ OK DOIs

- 10.1007/978-981-16-6054-2_16 is OK
- 10.1109/TKDE.2020.3045924 is OK
- 10.1145/3308558.3313562 is OK
- 10.1145/3447548.3467350 is OK
- 10.1109/TBDATA.2022.3177455 is OK
- 10.5281/zenodo.3828935 is OK
- 10.1145/3620665.3640366 is OK
- 10.1109/ACCESS.2022.3174197 is OK
- 10.1145/3535101 is OK
- 10.1038/s41586-021-03819-2 is OK

🟡 SKIP DOIs

- No DOI given, and none found for title: Heterogeneous graph transformer
- No DOI given, and none found for title: Knowledge Graphs
- No DOI given, and none found for title: Heterogeneous graph neural networks
- No DOI given, and none found for title: Knowledge graphs: Fundamentals, techniques, and ap...
- No DOI given, and none found for title: Fast Graph Representation Learning with PyTorch Ge...
- No DOI given, and none found for title: Neural message passing for quantum chemistry
- No DOI given, and none found for title: Deep Graph Library: A Graph-Centric, Highly-Perfor...
- No DOI given, and none found for title: Industry-scale Knowledge Graphs: Lessons and Chall...
- No DOI given, and none found for title: How LinkedIn economic graph bonds information and ...
- No DOI given, and none found for title: Finding Money Launderers Using Heterogeneous Graph...

❌ MISSING DOIs

- 10.25080/tcwv9851 may be a valid DOI for title: Exploring Network Structure, Dynamics, and Functio...
- 10.15611/eada.2024.2.03 may be a valid DOI for title: An Analysis of Novel Money Laundering Data Using H...
- 10.1080/00207543.2023.2257807 may be a valid DOI for title: Knowledge graph driven credit risk assessment for ...
- 10.1080/17460441.2021.1910673 may be a valid DOI for title: Knowledge graphs and their applications in drug di...

❌ INVALID DOIs

- None

@editorialbot
Copy link
Collaborator Author

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

@editorialbot
Copy link
Collaborator Author

Five most similar historical JOSS papers:

HyperNetX: A Python package for modeling complex network data as hypergraphs
Submitting author: @brendapraggastis
Handling editor: @danielskatz (Active)
Reviewers: @szhorvat, @IvanIsCoding, @drj11
Similarity score: 0.7094

GraphNeT: Graph neural networks for neutrino telescope event reconstruction
Submitting author: @asogaard
Handling editor: @dfm (Active)
Reviewers: @JostMigenda, @GageDeZoort
Similarity score: 0.6861

PyDGN: a Python Library for Flexible and Reproducible Research on Deep Learning for Graphs
Submitting author: @diningphil
Handling editor: @arfon (Active)
Reviewers: @idoby, @sepandhaghighi
Similarity score: 0.6825

Raphtory: The temporal graph engine for Rust and Python
Submitting author: @narnolddd
Handling editor: @luizirber (Active)
Reviewers: @abhishektiwari, @arashbm
Similarity score: 0.6815

textnets: A Python package for text analysis with networks
Submitting author: @jboynyc
Handling editor: @gkthiruvathukal (Active)
Reviewers: @sara-02, @tresoldi
Similarity score: 0.6655

⚠️ Note to editors: If these papers look like they might be a good match, click through to the review issue for that paper and invite one or more of the authors before considering asking the reviewers of these papers to review again for JOSS.

@Nikoleta-v3
Copy link

👋🏻 @crvernon I’d be happy to take on this submission!

@crvernon
Copy link

@editorialbot assign @Nikoleta-v3 as editor

Thanks @Nikoleta-v3!

@editorialbot
Copy link
Collaborator Author

Assigned! @Nikoleta-v3 is now the editor

@Nikoleta-v3
Copy link

Hey @maddataanalyst 👋🏻 Thank you very much for the submission to JOSS! As you can see here:
#7966 (comment), several of the references are missing DOIs.

Could you please go over the ones without a DOI and check whether they actually have one? Once you've updated the .bib file and pushed the changes, you can comment on the issue with:

@editorialbot check references

Thank you!

@Nikoleta-v3
Copy link

Hey @jboynyc and @cjbarrie! 👋 Would you be willing to review this submission for JOSS?

As you already know we conduct our checklist-driven reviews here in GitHub issues and follow these guidelines: https://joss.readthedocs.io/en/latest/review_criteria.html 😄

The submission I'd like you to review is titled: "HeXtractor: Extracting Heterogeneous Graphs from Structured and Textual Data for Graph Neural Networks". You can find more details at the top of this GitHub issue 🆙

Please let me know if you're available 😄 Since the Easter break is coming up in some countries, we can delay the review deadline. Thank you!

@maddataanalyst
Copy link

Hey @maddataanalyst 👋🏻 Thank you very much for the submission to JOSS! As you can see here: #7966 (comment), several of the references are missing DOIs.

Could you please go over the ones without a DOI and check whether they actually have one? Once you've updated the .bib file and pushed the changes, you can comment on the issue with:

@editorialbot check references

Thank you!

Sure, not a problem at all. I will look for the missing DOIs and let you know. Best regards and thank you for starting the review.

@jboynyc
Copy link

jboynyc commented Apr 14, 2025

I'd be happy to. I may indeed need a delayed timeline given the upcoming holidays.

@maddataanalyst
Copy link

maddataanalyst commented Apr 14, 2025

Citations updated. Let's see what the Editorial Bot has to say.

@maddataanalyst
Copy link

@editorialbot check references

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

✅ OK DOIs

- 10.1007/978-981-16-6054-2_16 is OK
- 10.1145/3366423.3380027 is OK
- 10.1109/TKDE.2020.3045924 is OK
- 10.1145/3308558.3313562 is OK
- 10.1145/3447548.3467350 is OK
- 10.1109/TBDATA.2022.3177455 is OK
- 10.1145/3447772 is OK
- 10.1007/978-981-16-6054-2_16 is OK
- 10.25080/TCWV9851 is OK
- 10.5281/zenodo.3828935 is OK
- 10.15611/eada.2024.2.03 is OK
- 10.1145/3620665.3640366 is OK
- 10.48550/arXiv.1903.02428 is OK
- 10.48550/arXiv.1704.01212 is OK
- 10.48550/arXiv.1909.01315 is OK
- 10.1145/3329781.3332266 is OK
- 10.1145/3219819.3219921 is OK
- 10.1080/00207543.2023.2257807 is OK
- 10.1109/ACCESS.2022.3174197 is OK
- 10.1145/3535101 is OK
- 10.1038/s41586-021-03819-2 is OK
- 10.1080/17460441.2021.1910673 is OK
- 10.48550/arXiv.2307.13499 is OK

🟡 SKIP DOIs

- No DOI given, and none found for title: Knowledge graphs: Fundamentals, techniques, and ap...

❌ MISSING DOIs

- None

❌ INVALID DOIs

- None

@maddataanalyst
Copy link

Looks good now. For this the book by Mayank Kejriwal, Craig A. Knoblock, Pedro Szekely, "Knowledge Graphs: Fundamentals, Techniques, and Applications" MIT Press, 2021 - I cannot find the DOI anywhere, probably it was not assigned.

@Nikoleta-v3
Copy link

Amazing, thank you so much for taking care of the DOIs @maddataanalyst! 🙏🏻

@Nikoleta-v3
Copy link

@jboynyc Thank you so much for agreeing to review! 😄

I may indeed need a delayed timeline given the upcoming holidays.

Of course, not an issue at all! 🐇

@Nikoleta-v3
Copy link

@editorialbot add @jboynyc as reviewer

@editorialbot
Copy link
Collaborator Author

@jboynyc added to the reviewers list!

@cjbarrie
Copy link

If I am still needed, I can take a look!

@Nikoleta-v3
Copy link

Amazing! Thank you 😄 🙏🏻

@Nikoleta-v3
Copy link

@editorialbot add @cjbarrie as reviewer

@editorialbot
Copy link
Collaborator Author

@cjbarrie added to the reviewers list!

@Nikoleta-v3
Copy link

@editorialbot start review

@editorialbot
Copy link
Collaborator Author

OK, I've started the review over in #8057.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning
Projects
None yet
Development

No branches or pull requests

6 participants