-
Notifications
You must be signed in to change notification settings - Fork 226
Articles showing dummy date of 12/31/1969 in "Last Update" column #1832
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Downgrading to 3.9.2 seems to fix this, FWIW. |
Thank you, we thought we had made sure that no default date of 0 (milliseconds after 1.1.1970 00:00 UTC) can appear anywhere. Can you tell us the feed source as well? Then we can make sure that we refine the feature to avoid that behavior. |
I was not able to find the source of such anomalies. According to the current code, if the feed source does not contain the last update, it is set to the publication date if that exists, or to the date when the article was received. For the publication date, if the feed does not contain it, the updated date (if it exists) or the current date is used, whatever is earlier. @barijaona, @josh64x2 or @Eitot, maybe you can find a reason for the observed anomalies. I am not sure about my anomaly, it could have been caused by using a development version of Vienna. @tonycpsu did you use any beta version? |
It certainly is.
Looks like the source is correct, have to look up whether "pubDate" is taken into account though. The date is the same as in the other cases, "0" but in your time zone.
That's a good hint and in my case, I had the same gut feeling but couldn't find the code causing that. I will try another time this weekend. |
I just realized that the RefreshManager is also manipulating the article dates, which may be causing effects that I have not considered so far. |
Hi @joostdekeijzer , thank you for adding that example. I suspect the RefreshManager, which I didn't fix in the last update, is the culprit. I hope to find some time this weekend to figure it out and create a fix. |
I just subscribed to https://exiftool.org/rss.xml and I do not reproduce the issue… ![]() |
The issue occurs when an article in the feed is updated by that site. That does not happen very often (and ExifTool is not a very busy feed anyway). Some other feed I get the 1970 issue with just now (after refresh of article) is https://feeds.feedburner.com/9To5Mac-MacAllDay . |
I have done the change I was talking about a couple of weeks ago, it's just waiting to be released. I'd suggest retesting afterwards. |
Will do. Thanks, @TAKeanice! |
Sorry to have to report that with |
Same here! I upgraded to this same version this morning, but I'm still seeing some "12/31/69, 6:00 PM" dates where I was seeing them in the earlier version.
…--Brian
--
Brian A. Levine
***@***.***
https://www.brianallenlevine.name/
On Dec 11, 2024, at 9:31 AM, joost de keijzer ***@***.***> wrote:
Sorry to have to report that with Version 3.9.5 :7784c797: (8414) I'm still experiencing the issue 😢
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
Can you @brianallenlevine and @joostdekeijzer tell me whether you're seeing this with the old articles that you already received before the upgrade or new ones that were loaded afterwards? The last update didn't include a script for correction again, and I'm afraid some of the dates of older articles are not salvageable. There's no code any more that I know that would assign strange dates to new articles. That's why it's important for me to know whether the articles with wrong dates are old or new ones. |
@TAKeanice, I just looked again at the tidbits and documentfoundation links that I posted screenshots of earlier. I still saw the "12/31/69, 6:00 PM" dates in both feeds, in at least some of the same old articles. After this, I unsubscrived and resubscribed both feeds. In the newly resubscribed feeds, I saw fewer articles, with only valid dates. I do have additional feeds with these invalid dates. Is there perhaps a simpler way of accomplishing an unsubscribe/resubscribe without manually unsubscribing and resubscribing each feed with bad dates? I've looked, but I'm not seeing anything. Maybe export all my subscriptions, unsubscribe them all, and then re-import would do the trick? I don't mind losing unread/read status and such, if I can get rid of the bad dates. Thanks! --Brian |
@TAKeanice I'm quite sure it's still happening on new articles. But I did not remove/resubscribe a feed. See WordPress Core rss feed (Agenda for dec-11 article, published dec-10) ![]() |
I'm again seeing some "12/31/69, 6:00PM" dates for "Last Update" and "Date Published" in the TidBITS feed I unsubscribed and resubscribed earlier.
…--
Brian A. Levine
***@***.***
https://www.brianallenlevine.name/
On Dec 13, 2024, at 2:04 AM, joost de keijzer ***@***.***> wrote:
@TAKeanice I'm quite sure it's still happening on new articles. But I did not remove/resubscribe a feed.
See WordPress Core rss feed (Agenda for dec-11 article, published dec-10)
image.png (view on web)
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Thank you @brianallenlevine . Means one more round of investigation, pulling out some more gray hair and hopefully finding the place that still manipulates the dates. Can you post a screenshot? |
Here you go, @TAKeanice. Thanks so much for all your effort on this! ![]() |
It's hard to test, but I get the impression that this issue has a bigger change of happening on startup when you have "refresh on startup" enabled or when you press the "Refresh all your subscriptions" button. |
I'm still seeing the issue with Version 3.9.5 :7784c797: (8414), haven't hit 'refresh all your subscriptions', and it appears to be happening on newly retrieved articles (after 'mark all read' in a given feed). Oddly, and visible in the screenshot, one of the articles has the wrong date -- 'Trudeau gave a speech to the Liberals' holiday party — but Freeland stole the show' is showing as '2024-06-23, 08:35', but a check of the website shows 'CBC News · Posted: Dec 18, 2024 3:26 PM EST | Last Updated: 8 hours ago' -- off by nearly 6 months! |
I think I found the cause: in vienna-rss/Vienna/Sources/Database/Database.m Line 1586 in 35bc17e
|
Oh man... I would never have searched in that direction... Let's make sure existingArticle is complete then! |
Minimalist approach regarding articles' caching makes completely sense when you may have plenty of feeds to update simultaneously, and you have to handle them in memory rather than on disk. We just need to reproduce the date logic which exists in -addArticle:toFolder: to calculate the new last update date. It would make sense to keep the existing publication date in database. |
I don't think that loading an article from the DB when an update is saved to the DB anyway is too much of an effort. |
in `-updateArticle:ofFolder:withArticle:`, `existingArticle` may be an incomplete version originating from `-minimalCacheForFolder:`, and may therefore have neither `lastUpdate` nor `publicationDate` To calculate the new last update date, we reproduce the date logic which exists in -addArticle:toFolder: Fix date 1.1.1970 or 31.12.1969 appearing in "Last Update" column Issue ViennaRSS#1832
It is a waste when you may have to handle tens of thousands of articles at a time during a "Refresh All" and you have the possibility to avoid this. Look thoroughly at commit 9b4c126 |
I'd rather not alter the last update date if we just don't know the original one because we got the ground truth from some cache. It may override semantically relevant information carried by that date. Assume a feed updates that date only for content changes, not for typo corrections - we would update the date anyway. Maybe we can write a smarter update query that combines the query and the update into one, then no additional time is wasted by a second query. |
in `-updateArticle:ofFolder:withArticle:`, `existingArticle` may be an incomplete version originating from `-minimalCacheForFolder:`, and may therefore have neither `lastUpdate` nor `publicationDate` To calculate the new last update date, we reproduce the date logic which exists in -addArticle:toFolder: Fix date 1.1.1970 or 31.12.1969 appearing in "Last Update" column Issue ViennaRSS#1832
I don't really understand your concern. |
I am thinking about feeds that have a last updated date initially, but an update doesn't have it any more. From what I saw in the code, that's possible for atom feeds for example. |
As suggested by @TAKeanice's comment in issue ViennaRSS#1832 ViennaRSS#1832 (comment)
As suggested by @TAKeanice's comment in issue ViennaRSS#1832 ViennaRSS#1832 (comment)
This comment has been minimized.
This comment has been minimized.
As suggested by @TAKeanice's comment in issue ViennaRSS#1832 ViennaRSS#1832 (comment)
Another edge case : Wired's RSS feed where they change the title of an article, push it with a new publication date, while the guid reveals that it is in fact an update of a preexisting article. |
As suggested by @TAKeanice's comment in issue ViennaRSS#1832 ViennaRSS#1832 (comment) However, handle the case where the title of an article is changed, and the updated article is pushed with a new publication date, while the `guid` value reveals that it is in fact an update of a preexisting article.
in `-updateArticle:ofFolder:withArticle:`, `existingArticle` may be an incomplete version originating from `-minimalCacheForFolder:`, and may therefore have neither `lastUpdate` nor `publicationDate` To calculate the new last update date, we reproduce the date logic which exists in -addArticle:toFolder: Fix date 1.1.1970 or 31.12.1969 appearing in "Last Update" column Issue ViennaRSS#1832
As suggested by @TAKeanice's comment in issue ViennaRSS#1832 ViennaRSS#1832 (comment) However, handle the case where the title of an article is changed, and the updated article is pushed with a new publication date, while the `guid` value reveals that it is in fact an update of a preexisting article.
Fix update date / Optimize cache - Provides a much snappier user interface : prevent some temporary locks when we have simultaneously feed refresh and user selecting a feed - Fix dates of year 1969 or 1970 in "Last Udpate" column (issue #1832)
Describe the bug
Since the most recent update, some of my articles are showing dummy date of 12/31/1969 in "Last Update" column.
To Reproduce
Not sure what caused it, but it seemed to appear around when I upgraded to
Version 3.9.3 :fd467d52: (8375)
I use Inoreader as my feed backend, if that is relevant.
Screenshots

Please complete the following information:
Version 3.9.3 :fd467d52: (8375)
14.1.1 (23B81)
Additional information:
The text was updated successfully, but these errors were encountered: