Skip to content

[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
editorialbot opened this issue Nov 11, 2023 · 18 comments
Labels
paused pre-review Python query-scope Submissions of uncertain scope for JOSS rejected TeX Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning

Comments

@editorialbot
Copy link
Collaborator

Submitting author: @ericchagnon15 (Eric Chagnon)
Repository: https://github.com/lbl-camera/berteley
Branch with paper.md (empty if default branch): paper
Version: 0.0.15
Editor: Pending
Reviewers: Pending
Managing EiC: Arfon Smith

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257"><img src="https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257/status.svg)](https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257)

Author instructions

Thanks for submitting your paper to JOSS @ericchagnon15. Currently, there isn't a JOSS editor assigned to your paper.

@ericchagnon15 if you have any suggestions for potential reviewers then please mention them here in this thread (without tagging them with an @). You can search the list of people that have already agreed to review and may be suitable for this submission.

Editor instructions

The JOSS submission bot @editorialbot is here to help you find and assign reviewers and start the main review. To find out what @editorialbot can do for you type:

@editorialbot commands
@editorialbot editorialbot added pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning labels Nov 11, 2023
@editorialbot
Copy link
Collaborator Author

Hello human, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

@editorialbot
Copy link
Collaborator Author

Software report:

github.com/AlDanial/cloc v 1.88  T=0.05 s (582.0 files/s, 92993.4 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Python                          10            649           1429           1628
Markdown                         7             67              0            195
TeX                              1              8              0            135
YAML                             5             24             20            118
CSS                              2             19              2             77
HTML                             1              8              4             36
DOS Batch                        1              8              1             26
make                             1              4              7              9
-------------------------------------------------------------------------------
SUM:                            28            787           1463           2224
-------------------------------------------------------------------------------


gitinspector failed to run statistical information for the repository

@editorialbot
Copy link
Collaborator Author

Wordcount for paper.md is 1134

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.18653/v1/2021.naacl-main.300 is OK
- 10.1145/2684822.2685324 is OK
- 10.18653/v1/2020.nlposs-1.19 is OK

MISSING DOIs

- 10.18653/v1/2020.acl-main.207 may be a valid DOI for title: SPECTER: Document-level Representation Learning using Citation-informed Transformers
- 10.18653/v1/2022.naacl-main.331 may be a valid DOI for title: Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity
- 10.18653/v1/d19-1410 may be a valid DOI for title: Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

INVALID DOIs

- None

@arfon
Copy link
Member

arfon commented Nov 11, 2023

A full length paper including experiments has been submitted to Elsevier NLP and is currently under review. This software provides support for conducting topic modeling on scientific articles.

@ericchagnon15 – thanks for your submission to JOSS. Could you say more about this paper? i.e., what is the title, what is the focus of the paper?

Also, could you clarify what license this software is released under? It looks like a BSD 3-clause but appears to have additional language?

@editorialbot
Copy link
Collaborator Author

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

@arfon
Copy link
Member

arfon commented Nov 24, 2023

Friendly reminder here @ericchagnon15

@arfon arfon added the paused label Nov 24, 2023
@ericchagnon15
Copy link

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

The license we have is the standard BSD 3-clause, and the additional language is a requirement from the intellectual property office and Lawrence Berkeley National Lab.

@arfon
Copy link
Member

arfon commented Dec 9, 2023

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

Is this something you could share with me to review? My concern here is possible duplicate publications for the same work so I'd like to see how much of the other paper is discussing this software. You can email me a draft at [email protected]

@arfon
Copy link
Member

arfon commented Dec 11, 2023

The license we have is the standard BSD 3-clause, and the additional language is a requirement from the intellectual property office and Lawrence Berkeley National Lab.

@ericchagnon15 – we'll need you to update this to a truly 'standard' (i.e., no modifications) license before proceeding with the review. This has happened for LBNL submissions before see here for example: #1370 (comment)

@ericchagnon15
Copy link

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

Is this something you could share with me to review? My concern here is possible duplicate publications for the same work so I'd like to see how much of the other paper is discussing this software. You can email me a draft at [email protected]

Here is the link to the preprint version.

https://www.sciencedirect.com/science/article/pii/S2949719123000419

@arfon
Copy link
Member

arfon commented Dec 13, 2023

Thanks @ericchagnon15. I'm going to open a scope review here so other editors can weigh in but given phrases such as:

This paper presents BERTeley, a Python package built upon BERTopic to address This unique challenge. Through modifications and enhancements to the BERTopic methodology, BERTeley provides a simple, yet powerful tool for topic modeling on a corpus consisting of scientific articles

...I'm inclined to say this isn't publishable here as the JOSS paper would count as a duplicate publication for the same work.

@arfon
Copy link
Member

arfon commented Dec 13, 2023

@editorialbot query scope

@editorialbot
Copy link
Collaborator Author

Submission flagged for editorial review.

@editorialbot editorialbot added the query-scope Submissions of uncertain scope for JOSS label Dec 13, 2023
@arfon
Copy link
Member

arfon commented Jan 7, 2024

@ericchagnon15 – apologies it has taken a while for us to get back to you here but after consideration by the JOSS editorial team we've determined that we believe this article would represent a second publication for the work outlined in this other article (https://www.sciencedirect.com/science/article/pii/S2949719123000419), and as such, is not permitted here.

@arfon
Copy link
Member

arfon commented Jan 7, 2024

@editorialbot reject

@editorialbot
Copy link
Collaborator Author

Paper rejected.

@ericchagnon15
Copy link

@arfon thanks for taking the time to deliberate and for your consideration!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
paused pre-review Python query-scope Submissions of uncertain scope for JOSS rejected TeX Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning
Projects
None yet
Development

No branches or pull requests

3 participants