You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

52 lines
1.4 KiB

### About
mucc is a tool for processing data recovered by scalpel.
It's features include:
1. Splitting PDF files into sub-files.
2. Deleting duplicate files, which can be used independently.
<br>
<br>
### How it Works
##### Retrieving Sub-Files
scalpel parses a disk image for %PDF headers
and %EOF footers.
If max_filsize is set high, the generated files will often consist of several
concatenated sub-files.
Here mucc finds the nested %PDF and %EOF tags
and returns the files with byte sized precision.
##### Deleting Duplicates
Here mucc calculates the md5 hash of each file and deletes the identical files.
<br>
<br>
### Contents of `src/app`
| Class | Description |
|---------------|-------------|
| Artifacts | Simple objects used by other classes.|
| Controller | JavaFX class containing application logic. |
| Execute | Issues shell commands.|
| layout.fxml | Contains layout data.|
| Main | Main JavaFX class. Run from here.|
| QuicksortMd5 | Quicksort algorithm.|
| routines | Contains higher level routines called by Controller.|
| Tools | Simple tools used by other classes.|
| Write | Writes to /tmp. Used for data storage.|
### Issues and Features
- Fix issues where nested duplicates would not be deleted on first pass.
- Make code prettier.
- Add scalpel integration.
- Replace "__" with progress indicators for states.
### Screenshot
![UI](src/app/img/screen.png)