- cross-posted to:
- piracy@lemmy.ml
- datahoarder@lemmy.ml
- cross-posted to:
- piracy@lemmy.ml
- datahoarder@lemmy.ml
This is by far the largest music metadata database that is publicly available. For comparison, we have 256 million tracks, while others have 50-150 million. Our data is well-annotated: MusicBrainz has 5 million unique ISRCs, while our database has 186 million.
Does this mean the MusicBrainz database will soon go from 5 million to 186 million tracks?
Asking the real questions here…
Probably not worth it to store the AI tracks
If I ran mb, I would be cautious importing the data directly. I’m sure Spotify would consider it trade information and go after anyone directly using it. However if a few million people added the tracks with individual edits then it probably won’t take too long.
I’ll strongly suggest to take out all the cheaply AI generated music from this “back up” and save themselves some space.
I’m not sure how they would go about doing that at scale without also getting some false positives and removing human music too
The data they compiled is really cool.
If reading the chart right, the genera with the most artists is opera.
Even if they didn’t have the music files, the analysis on the metadata is insane.
Publicly admitting they are the origin of the torrents is definitely
a riskyan insane move. I don’t think they want Sony going after them, but also fuck Sony for locking art behind shitty contracts that forces these kind of projects to exist.Publicly admitting they are the origin of the torrents is definitely a risky an insane move. I don’t think they want Sony going after them
Let’s be honest: Everybody is trying to go after Annas Archive. Every book publisher wants to get them, the US government, too and it really doesn’t matter if every music publisher wants them also. I hope that they are based in a country where the western systems can’t get them
I hope (also assume since it hasn’t been taken down yet) it’s more of a decentralised deal with servers in many places and backups in every nation under the sun
The 3 major labels are equally predatory not only Sony
There’s definitely gonna be some crazy guy who will put this on their server and stream it to their phones lol
I stream mine through Plexamp. Up to almost 400k tracks.
Oh im thinking of it lol
My first though as well
- Over-focus on the most popular artists. There is a long tail of music which only gets preserved when a single person cares enough to share it. And such files are often poorly seeded.
- We primarily used Spotify’s “popularity” metric to prioritize tracks. View the top 10,000 most popular songs in this HTML file (13.8MB gzipped).
- For popularity>0, we got close to all tracks on the platform. The quality is the original OGG Vorbis at 160kbit/s. Metadata was added without reencoding the audio (and an archive of diff files is available to reconstruct the original files from Spotify, as well as a metadata file with original hashes and checksums).
- For popularity=0, we got files representing about half the number of listens (either original or a copy with the same ISRC). The audio is reencoded to OGG Opus at 75kbit/s — sounding the same to most people, but noticeable to an expert.
Perhaps I’m reading this wrong, but is this not a little backwards? Since unpopular music is poorly preserved, shouldn’t the focus be on getting the least popular music first?
It depends on what your goal is: If you want to preserve the music that is important to most people or to the era, you should start with the most popular stuff. And Spotify has a big spam problem. Everybody who thinks he is a DJ wants his music to be on there and there is so much AI music flooding the scene. So it does make sense to backup what people are actually listening and not some AI-generated music spam nobody cares about.
I mean, they say earlier that music is actually well-preserved, but it’s disproportionately popular music. If the goal is then to preserve everything, I’d expect them to go for stuff that isn’t likely to be in some random audiophile’s collection or whatever then.
I am pretty sure the major labels are already preserving the most mainstream artists. Msybe it should be sorting by the most popular independent artists
The politics of preservation is definitely an interesting one. I suppose one argument in favor of preserving more popular music is that there are going to be fewer popular tracks than unpopular tracks - and they’re already at 300TB, which is nothing to sneeze at, especially since it’s a third the size of their existing library of ebooks.
I agree. I seed torrents/files that took me a long time to finish.
This is the one thing on Spotify I can’t get elsewhere. Would be nice to have a non transcode copy.
https://open.spotify.com/album/4emoC6C9fCDkWPdTuxN9an
…Like Cologne (Spotify Exclusive)
Queens of the Stone Age
2013 • 3 songs • 14 min 5 secWell, since this archive says it contains the original ogg @160kbps for all artists with a popularity >0, it’ll be in this collection. Your wait may be over soon.
sweet
deleted by creator
Spotify is why I set up a Funkwhale server
Is funkwhale also a sort of soulseek?
Oh no, around here we mention esoteric software but we will never include any extra information in the post. If you know you know.
No need to play secret hacker around here lol 😊
Soulseek afict requires dedicated clients. The Subsonic standard is supported by more & more mobile/PC apps, I wish it was supported
I guess I gotta donate more to anna
Oo, I’ll have to check those when they release. I follow some artists that only upload to YouTube and Spotify, neither of which is ideal.
So hear me out. Streamio can stream video from various sources including torrents. So it should be possible to create some music frontend that can access the music library similarly. Right? There are probably someone who is creating this right now.









