You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> :iphone:**Try the new [difPy Web App](https://difpy.app/)**!
20
-
21
-
> :point_right::new: With **difPy v3.x** you can count on signifcant **performance increases**, **new features** and **bug fixes**. Check out the [release notes](https://github.com/elisemercury/Duplicate-Image-Finder/releases/) for details.
23
+
> :point_right::new:**difPy v4-beta** is out! difPy v4 is up to **10x as fast** as previous difpy versions. Check out the [release notes](https://github.com/elisemercury/Duplicate-Image-Finder/releases/) for details.
22
24
23
25
> :open_hands: Our motto? We :heart: Open Source! **Contributions and new ideas for difPy are always welcome** - check our [Contributor Guidelines](https://github.com/elisemercury/Duplicate-Image-Finder/wiki/Contributing-to-difPy) for more information.
24
26
@@ -31,32 +33,40 @@ Check out the [difPy package on PyPI.org](https://pypi.org/project/difPy/)
31
33
## Description
32
34
difPy searches for images in **one or more different folders**, compares the images it found and checks whether these are duplicates. It then outputs the **image files classified as duplicates** as well as the **images having the lowest resolutions**, so you know which of the duplicate images are safe to be deleted. You can then either delete them manually, or let difPy delete them for you.
difPy does not compare images based on their hashes. It compares them based on their tensors i. e. the image content - this allows difPy to **not only search for duplicate images, but also for similar images**.
39
37
38
+
:notebook: For a **detailed usage guide**, please view the official **[difPy Usage Documentation](https://difpy.readthedocs.io/)**.
Folder paths can be specified as standalone Python strings, or within a list.
62
+
Folder paths can be specified as standalone Python strings, or within a list.`difPy.build()` first builds a collection of images by scanning the provided folders and generating image tensors. `difPy.search()` then starts the search for duplicate image.
54
63
55
64
:notebook: For a **detailed usage guide**, please view the official **[difPy Usage Documentation](https://difpy.readthedocs.io/)**.
56
65
57
66
## Output
58
67
difPy returns various types of output that you may use depending on your use case:
59
68
69
+
### I. Result Dictionary
60
70
A **JSON formatted collection** of duplicates/similar images (i. e. **match groups**) that were found, where the keys are a **randomly generated unique id** for each image file:
61
71
62
72
```python
@@ -72,7 +82,7 @@ search.result
72
82
...
73
83
}
74
84
```
75
-
85
+
### II. Lower Quality List
76
86
A **list** of duplicates/similar images that have the **lowest quality** among match groups:
77
87
78
88
```python
@@ -82,37 +92,57 @@ search.lower_quality
82
92
["C:/Path/to/Image/duplicate_image1.jpg",
83
93
"C:/Path/to/Image/duplicate_image2.jpg", ...]
84
94
```
85
-
A **JSON formatted collection** with statistics on the completed difPy process:
95
+
96
+
Lower quality images then can be moved to a different location:
> :point_right: Windows users can add difPy to their [PATH system variables](https://www.computerhope.com/issues/ch000549.htm) by pointing it to their difPy package installation folder containing the [`difPy.bat`](https://github.com/elisemercury/Duplicate-Image-Finder/difPy/difPy.bat) file. This adds `difPy` as a command in the CLI and will allow direct invocation of `difPy` from anywhere on the device.
When no directory parameter is given in the CLI, difPy will **run on the current working directory**.
145
181
146
-
When running from the CLI, the output of difPy is written to files and saved in the working directory by default. To change the default output directory, specify the `-Z / -output_directory` parameter. The "xxx" in the output filenames is a unique timestamp:
182
+
When running from the CLI, the output of difPy is written to files and **saved in the working directory** by default. To change the default output directory, specify the `-Z / -output_directory` parameter. The "xxx" in the output filenames is the current timestamp:
147
183
148
184
```python
149
-
difPy_results_xxx.json
150
-
difPy_lower_quality_xxx.csv
151
-
difPy_stats_xxx.json
185
+
difPy_xxx_results.json
186
+
difPy_xxx_lower_quality.csv
187
+
difPy_xxx_stats.json
152
188
```
153
189
154
190
:notebook: For a **detailed usage guide**, please view the official **[difPy Usage Documentation](https://difpy.readthedocs.io/)**.
155
191
156
192
## difPy Web App
157
193
158
-
difPy can also be accessed via its new **web interface**. With difPy Web, you can compare **up to 200 images** and download a **deduplicated ZIP file** - all powered by difPy. [Read more](https://github.com/elisemercury/difPy-app).
194
+
difPy can also be accessed via a browser. With difPy Web, you can compare **up to 200 images** and download a **deduplicated ZIP file** - all powered by difPy. [Read more](https://github.com/elisemercury/difPy-app).
159
195
160
196
:iphone:**Try the new [difPy Web App](https://difpy.app/)**!
161
197
@@ -166,5 +202,5 @@ difPy can also be accessed via its new **web interface**. With difPy Web, you ca
0 commit comments