![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
Branch: refs/heads/develop
Home: https://github.com/dreamwidth/dw-free
Commit: 8fa35004f538ad845a869abd17d13630e9e3f0b5
https://github.com/dreamwidth/dw-free/commit/8fa35004f538ad845a869abd17d13630e9e3f0b5
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-03 (Wed, 03 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
(Bug 5080) Grab all subjects and logtimes from a journal
Commit: d1a25f7851fe980d5c75b6350d28a8f026b87904
https://github.com/dreamwidth/dw-free/commit/d1a25f7851fe980d5c75b6350d28a8f026b87904
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-11 (Thu, 11 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
Log Message:
-----------
(Bug 5080) Separate out pruning logic into a method, and add duplicate detection
This is a best guess based on exact server timestamp and subject. If
both match exactly, we figure that this was a previously imported entry,
or maybe crossposted entry, that is not linked properly (as an import /
crosspost) for some reason.
Commit: cc60e576bca4f3798266cae5d61c0c37f7c9d8bb
https://github.com/dreamwidth/dw-free/commit/cc60e576bca4f3798266cae5d61c0c37f7c9d8bb
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-17 (Wed, 17 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
(Bug 5080) Log when we skip an entry because it looks like a dupe
So we know what's going on. We had to move this below the existing
checks, so that we can log only if it's a guess (not if we're certain
beacuse it had been previously imported/crossposted)
Also adds error-checking for db issues.
Commit: 747df055f90541d0415af5ca074d1d89a1d9f359
https://github.com/dreamwidth/dw-free/commit/747df055f90541d0415af5ca074d1d89a1d9f359
Author: Afuna <afunamatata+github@gmail.com>
Date: 2013-07-17 (Wed, 17 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
Merge pull request #498 from afuna/importer-heuristic-bug5080
(Bug 5080) Detect duplicates based on timestamp / subject
Compare: https://github.com/dreamwidth/dw-free/compare/a19c4996915d...747df055f905
Home: https://github.com/dreamwidth/dw-free
Commit: 8fa35004f538ad845a869abd17d13630e9e3f0b5
https://github.com/dreamwidth/dw-free/commit/8fa35004f538ad845a869abd17d13630e9e3f0b5
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-03 (Wed, 03 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
(Bug 5080) Grab all subjects and logtimes from a journal
Commit: d1a25f7851fe980d5c75b6350d28a8f026b87904
https://github.com/dreamwidth/dw-free/commit/d1a25f7851fe980d5c75b6350d28a8f026b87904
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-11 (Thu, 11 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
Log Message:
-----------
(Bug 5080) Separate out pruning logic into a method, and add duplicate detection
This is a best guess based on exact server timestamp and subject. If
both match exactly, we figure that this was a previously imported entry,
or maybe crossposted entry, that is not linked properly (as an import /
crosspost) for some reason.
Commit: cc60e576bca4f3798266cae5d61c0c37f7c9d8bb
https://github.com/dreamwidth/dw-free/commit/cc60e576bca4f3798266cae5d61c0c37f7c9d8bb
Author: Afuna <fu@dreamwidth.org>
Date: 2013-07-17 (Wed, 17 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
(Bug 5080) Log when we skip an entry because it looks like a dupe
So we know what's going on. We had to move this below the existing
checks, so that we can log only if it's a guess (not if we're certain
beacuse it had been previously imported/crossposted)
Also adds error-checking for db issues.
Commit: 747df055f90541d0415af5ca074d1d89a1d9f359
https://github.com/dreamwidth/dw-free/commit/747df055f90541d0415af5ca074d1d89a1d9f359
Author: Afuna <afunamatata+github@gmail.com>
Date: 2013-07-17 (Wed, 17 Jul 2013)
Changed paths:
M cgi-bin/DW/Worker/ContentImporter/LiveJournal/Entries.pm
M cgi-bin/DW/Worker/ContentImporter/Local/Entries.pm
Log Message:
-----------
Merge pull request #498 from afuna/importer-heuristic-bug5080
(Bug 5080) Detect duplicates based on timestamp / subject
Compare: https://github.com/dreamwidth/dw-free/compare/a19c4996915d...747df055f905