[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

editorialbot · 2023-11-11T03:42:40Z

Submitting author: @ericchagnon15 (Eric Chagnon)
Repository: https://github.com/lbl-camera/berteley
Branch with paper.md (empty if default branch): paper
Version: 0.0.15
Editor: Pending
Reviewers: Pending
Managing EiC: Arfon Smith

Status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257"><img src="https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257/status.svg)](https://joss.theoj.org/papers/d2faab45e877eb77b380ec7c73f63257)

Author instructions

Thanks for submitting your paper to JOSS @ericchagnon15. Currently, there isn't a JOSS editor assigned to your paper.

@ericchagnon15 if you have any suggestions for potential reviewers then please mention them here in this thread (without tagging them with an @). You can search the list of people that have already agreed to review and may be suitable for this submission.

Editor instructions

The JOSS submission bot @editorialbot is here to help you find and assign reviewers and start the main review. To find out what @editorialbot can do for you type:

@editorialbot commands

The text was updated successfully, but these errors were encountered:

editorialbot · 2023-11-11T03:42:42Z

Hello human, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

editorialbot · 2023-11-11T03:42:44Z

Software report:

github.com/AlDanial/cloc v 1.88  T=0.05 s (582.0 files/s, 92993.4 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Python                          10            649           1429           1628
Markdown                         7             67              0            195
TeX                              1              8              0            135
YAML                             5             24             20            118
CSS                              2             19              2             77
HTML                             1              8              4             36
DOS Batch                        1              8              1             26
make                             1              4              7              9
-------------------------------------------------------------------------------
SUM:                            28            787           1463           2224
-------------------------------------------------------------------------------


gitinspector failed to run statistical information for the repository

editorialbot · 2023-11-11T03:42:46Z

Wordcount for paper.md is 1134

editorialbot · 2023-11-11T03:43:15Z

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.18653/v1/2021.naacl-main.300 is OK
- 10.1145/2684822.2685324 is OK
- 10.18653/v1/2020.nlposs-1.19 is OK

MISSING DOIs

- 10.18653/v1/2020.acl-main.207 may be a valid DOI for title: SPECTER: Document-level Representation Learning using Citation-informed Transformers
- 10.18653/v1/2022.naacl-main.331 may be a valid DOI for title: Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity
- 10.18653/v1/d19-1410 may be a valid DOI for title: Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

INVALID DOIs

- None

arfon · 2023-11-11T03:43:26Z

A full length paper including experiments has been submitted to Elsevier NLP and is currently under review. This software provides support for conducting topic modeling on scientific articles.

@ericchagnon15 – thanks for your submission to JOSS. Could you say more about this paper? i.e., what is the title, what is the focus of the paper?

Also, could you clarify what license this software is released under? It looks like a BSD 3-clause but appears to have additional language?

editorialbot · 2023-11-11T03:43:55Z

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

arfon · 2023-11-24T09:06:23Z

Friendly reminder here @ericchagnon15

ericchagnon15 · 2023-12-04T17:54:40Z

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

The license we have is the standard BSD 3-clause, and the additional language is a requirement from the intellectual property office and Lawrence Berkeley National Lab.

arfon · 2023-12-09T19:31:23Z

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

Is this something you could share with me to review? My concern here is possible duplicate publications for the same work so I'd like to see how much of the other paper is discussing this software. You can email me a draft at [email protected]

arfon · 2023-12-11T15:24:25Z

The license we have is the standard BSD 3-clause, and the additional language is a requirement from the intellectual property office and Lawrence Berkeley National Lab.

@ericchagnon15 – we'll need you to update this to a truly 'standard' (i.e., no modifications) license before proceeding with the review. This has happened for LBNL submissions before see here for example: #1370 (comment)

ericchagnon15 · 2023-12-12T23:00:50Z

Sorry for the delayed response. Elsevier NLP recently accepted our paper titled "Benchmarking topic models on scientific articles using BERTeley" and it focuses on carrying out a benchmarking experiment on different language models for the task of topic modeling.

Is this something you could share with me to review? My concern here is possible duplicate publications for the same work so I'd like to see how much of the other paper is discussing this software. You can email me a draft at [email protected]

Here is the link to the preprint version.

https://www.sciencedirect.com/science/article/pii/S2949719123000419

arfon · 2023-12-13T08:32:26Z

Thanks @ericchagnon15. I'm going to open a scope review here so other editors can weigh in but given phrases such as:

This paper presents BERTeley, a Python package built upon BERTopic to address This unique challenge. Through modifications and enhancements to the BERTopic methodology, BERTeley provides a simple, yet powerful tool for topic modeling on a corpus consisting of scientific articles

...I'm inclined to say this isn't publishable here as the JOSS paper would count as a duplicate publication for the same work.

arfon · 2023-12-13T08:32:31Z

@editorialbot query scope

editorialbot · 2023-12-13T08:32:33Z

Submission flagged for editorial review.

arfon · 2024-01-07T20:29:53Z

@ericchagnon15 – apologies it has taken a while for us to get back to you here but after consideration by the JOSS editorial team we've determined that we believe this article would represent a second publication for the work outlined in this other article (https://www.sciencedirect.com/science/article/pii/S2949719123000419), and as such, is not permitted here.

arfon · 2024-01-07T20:29:58Z

@editorialbot reject

editorialbot · 2024-01-07T20:30:01Z

Paper rejected.

ericchagnon15 · 2024-01-07T20:31:54Z

@arfon thanks for taking the time to deliberate and for your consideration!

editorialbot added pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning labels Nov 11, 2023

editorialbot added Python TeX labels Nov 11, 2023

arfon added the paused label Nov 24, 2023

editorialbot added the query-scope Submissions of uncertain scope for JOSS label Dec 13, 2023

editorialbot added the rejected label Jan 7, 2024

editorialbot closed this as completed Jan 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

arfon commented Nov 11, 2023 •

edited

Loading

editorialbot commented Nov 11, 2023

arfon commented Nov 24, 2023

ericchagnon15 commented Dec 4, 2023

arfon commented Dec 9, 2023

arfon commented Dec 11, 2023

ericchagnon15 commented Dec 12, 2023

arfon commented Dec 13, 2023

arfon commented Dec 13, 2023

editorialbot commented Dec 13, 2023

arfon commented Jan 7, 2024

arfon commented Jan 7, 2024

editorialbot commented Jan 7, 2024

ericchagnon15 commented Jan 7, 2024

[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

[PRE REVIEW]: BERTeley: A python package for topic modeling on scientific articles #6052

Comments

editorialbot commented Nov 11, 2023

Status

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

editorialbot commented Nov 11, 2023

arfon commented Nov 11, 2023 • edited Loading

editorialbot commented Nov 11, 2023

arfon commented Nov 24, 2023

ericchagnon15 commented Dec 4, 2023

arfon commented Dec 9, 2023

arfon commented Dec 11, 2023

ericchagnon15 commented Dec 12, 2023

arfon commented Dec 13, 2023

arfon commented Dec 13, 2023

editorialbot commented Dec 13, 2023

arfon commented Jan 7, 2024

arfon commented Jan 7, 2024

editorialbot commented Jan 7, 2024

ericchagnon15 commented Jan 7, 2024

arfon commented Nov 11, 2023 •

edited

Loading