You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
57 lines
1.4 KiB
57 lines
1.4 KiB
## About
|
|
mucc is a tool for processing data recovered by scalpel.
|
|
It's features include:
|
|
1. Splitting PDF files into sub-files.
|
|
2. Deleting duplicate files, which can be used independently.
|
|
|
|
<br>
|
|
<br>
|
|
|
|
## How it Works
|
|
|
|
#### Retrieving Sub-Files
|
|
|
|
scalpel parses disk images for %PDF headers
|
|
and %EOF footers.
|
|
If max_filsize is set high, the generated files will often consist of several
|
|
concatenated sub-files.
|
|
Here mucc finds the nested %PDF and %EOF tags
|
|
and returns the files with byte sized precision.
|
|
|
|
#### Deleting Duplicates
|
|
Here mucc calculates the md5 hash of each file and deletes the identical files.
|
|
|
|
<br>
|
|
<br>
|
|
|
|
## Contents of `src/app`
|
|
|
|
| Class | Description |
|
|
|---------------|-------------|
|
|
| Artifacts | Simple objects used by other classes.|
|
|
| Controller | JavaFX class containing application logic. |
|
|
| Execute | Issues shell commands.|
|
|
| layout.fxml | Contains layout data.|
|
|
| Main | Main JavaFX class. Run from here.|
|
|
| QuicksortMd5 | Quicksort algorithm.|
|
|
| routines | Contains higher level routines called by Controller.|
|
|
| Tools | Simple tools used by other classes.|
|
|
| Write | Writes to /tmp. Used for data storage.|
|
|
|
|
<br>
|
|
<br>
|
|
|
|
## Issues and Features
|
|
- Fix issues where nested duplicates would not be deleted on first pass.
|
|
- Make code prettier.
|
|
- Add scalpel integration.
|
|
- Replace "__" with progress indicators for states.
|
|
|
|
<br>
|
|
<br>
|
|
|
|
## Screenshot
|
|
|
|

|
|
|