Ha! I wrote it! Well the original anyway. It’s been forked a few times since I stepped away.
So yeah, I think it’s pretty cool 😆
Legend!
Do you use NGX yourself?
Actually, I stepped away from the project 'cause I stopped using it altogether. I started the project to satisfy the British government with their ridiculous requirements for proof of my relationship with my wife so I could live here. Once I was settled though and didn’t need to be able to bring up flight itineraries from 5 years ago, it stopped being something I needed.
Well that, and lemme tell you, maintaining a popular Free software project is HARD. Everyone has an idea of where stuff should go, but most of the contributions come in piecemeal, so you’re left mostly acting as the one trying to wrangle different styles and architectures into something cohesive… while you’re also holding down a day job. It was stressful to say the least, and with a kid on the way, something had to give.
But every once in a while I consider installing paperless-ngx just to see how it’s come along, and how much has changed. I’m absolutely delighted that it’s been running and growing in my absence, and from the screenshots alone, I see that a lot of the ideas people had when I was helming made it in in the end.
Thank you very much for the generously contributed code and time while working on it.The effort you put in, will live on for many years to come.
Aww! Thank you! It was fun ❤️
Hey man, that is what I used it for, but with the Belgian government! Great piece of software though!
Oh wow! Quite a journey!
I’d consider Paperless a hall-of-famer for self-hosted software and something most people who get into self-hosting discover at some point, even if they don’t use it.
So thanks for building it, even if you’ve moved on. You gave the forkers something great to build from.
It is honestly an awesone tool.
Just want to say thank you! Paperless is one of the first things I recommend to anyone considering self hosting their infra. Amazing piece of work!
Thanks! The crazy thing is that it’s really not that complicated. I’d say the hardest work was in writing the docs :-). It’s awesome to hear that people still use it and love it though.
Super good, it is increíble useful and the ability to find any document in almost any place in seconds in awesome.
Once this is said, you need to stick to a process and it is time consuming, and of course, you need to manually review the automatics tagging feature.
So, It is not a set and forget like most of the people expect
Slow and unreliable with sqlite, but rock solid and amazing with postgres.
Today, every document I receive goes into my duplex ADF scanner to scan to a network share which is monitored by Paperless. Documents there are ingested and pre-tagged, waiting for me to review them in the inbox. Unlike other posters here, I find the tagging process extremely fast and easy. Granted, I didn’t have to bring in thousands of documents to begin with but started from a clean slate.
What’s more, development is incredibly fast-moving and really useful features are added all the time.
Made my life a lot easier. No more looking for documents, all is in one place, fulltext search… Don’t ever want to go back
But most important: always have a backup ☝🏻
I haven’t really configured a tagging system that makes any sense so it’s mostly used the search through documents through text. I’d like to figure out how to hook up a vector database to it to do really fuzzy searching
What scanners do people recommend?
If you have an Android phone I can’t recommend Genius Scan enough. Fast, accurate, lots of features. I use it with syncthing by exporting the files to a folder that’s configured to sync the paperless input folder.
Tried to use it, but I don’t want to move all of my data from my currently laid out folder/file structure into a docker container that I then need to backup/upgrade/feed/water/etc., especially when my grasp on docker containers is limited (at best) and I’m dealing with “production” data.
I wish the software worked like Immach; I could point it at a root folder and it would index everything with read only rights.
That, and I’m slightly worried that this iteration will stop being supported and it gets forked (again) which is great that it can be forked but I have no idea what would go into migrating data (see my limited docker knowledge from the first sentence).
Well you point the docker to some external data. You do never store the documents inside the docker. (Because it would get lost when it is updated)
It is comparable to the way Immich works.
Maybe I’m misunderstanding this, but their FAQ specifically says:
By default, your documents are stored inside the docker volume paperless_media. Docker manages this volume automatically for you.
It also says that documents are removed from the consumption directory, renamed, and put into a folder that you shouldn’t modify.
And that’s my problem with the project. I want to be able to keep my file name and organizational structure.
Have a look here: [https://github.com/paperless-ngx/paperless-ngx/blob/main/docker/compose/docker-compose.postgres.yml](paperless-ngx docker-compose.yml)
down under
webserver:
you changedata:/usr/src/paperless/data
to/path/to/where/you/wantorhave/your/files:/usr/src/paperless/data
. Same for the media path and you’re done. paperless now uses a folder on your machine instead of a volume. If you want to be clean you will then also remove the volume declaration at the bottom of the file.i think OP wants it to leave their current files alone. But Paperless doesn’t work like that, it deletes the originals and arranges the files its own way.
I do want to leave my current files alone.