-
-
Notifications
You must be signed in to change notification settings - Fork 2.8k
[WIP] Add DOI matching with not full coincidence. #6756
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Add zero weights for otes and comments.
c8bea4d
to
6c8b7d7
Compare
@@ -234,6 +234,12 @@ public String getNormalized() { | |||
return doi; | |||
} | |||
|
|||
public boolean isCompareNotExact(DOI o2) { | |||
String s1 = this.doi.replaceAll("[^\\w]", ""); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is the "replace" here necessary? Maybe it would be worthwhile to add it to the other cleanup that's already happing in the constructor?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, maybe it is. I decided to do in this way in order to divide logics of creature, comparison and inaccurate comparison of DOI. This is associated with task context. The DOI standart is not strictly defined. Some special characters can serve as indicators or as special symbols.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@KunAndrew Can you add a test for this?
We agree that the DOI standard is vague at some places. However, in the concrete case, only white spaces are replaced.
Whitespace in aribtrary locations lead to a non-found doi.
Proposed solution: Can you put that cleanup into the constructor the next two weeks? 😇
small improvements
@KunAndrew How is the status here? Is this ready for review? It woud be really cool to have this, it's a massive improvement! |
Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh.
Sorry for the long wait. |
Please check the work of scripts/generate-authors.sh. I distrust the result. |
Thanks for the update. |
Moving cleanup's logic to DOI constructor.Add results of generate-authors.sh. (revert AUTHORS)
I reverted the AUTHORS. I upgraded project from github. Then I merged in master (on my local machine) and pushed to remoted. But I have problem with resolving problems. "These conflicts to complex to resolve in the web editor." How should I do it? Should I create a new Pull Request (from my master-branch) and close the current one? |
@KunAndrew The best thing is to do this on the command line. You need to merge in the upstream the master branch from the jabref repository..
If you happen to have still some problems or don't feel confident enough I can take over and do the conflict resolving |
Yes, that would be great. I don't feel confident enough with git GUI. |
Add DOI matching with not full coincidence.
"Improvement of Duplicate Detection #6707"
Fixes #6707