Bug 85959 - do not count 'the' in artist name in music database
Summary: do not count 'the' in artist name in music database
Status: RESOLVED FIXED
Alias: None
Product: amarok
Classification: Applications
Component: general (show other bugs)
Version: unspecified
Platform: Debian testing Linux
: NOR wishlist
Target Milestone: ---
Assignee: Seb Ruiz
URL:
Keywords:
: 87095 95313 96135 105647 (view as bug list)
Depends on:
Blocks:
 
Reported: 2004-07-25 20:23 UTC by Simon
Modified: 2006-09-12 14:18 UTC (History)
5 users (show)

See Also:
Latest Commit:
Version Fixed In:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Simon 2004-07-25 20:23:39 UTC
Version:            (using KDE KDE 3.2.2)
Installed from:    Debian testing/unstable Packages
OS:                Linux

Suppose I have two albums from the Beatles. One has 'Beatles' as the artist in the id3-tag, the other has 'The Beatles'. It now shows up in two different places in the music library. Maybe it should be better to clip the 'the', at least internally in the database for more consistency and to have it show up in one place in the library.
Comment 1 Christian Muehlhaeuser 2004-08-25 01:53:59 UTC
*** Bug 87095 has been marked as a duplicate of this bug. ***
Comment 2 skrald 2004-12-18 13:29:54 UTC
Warning: The band "The The" shouldn't be interpretted as the empty string... :-)
Comment 3 rred 2004-12-18 16:40:34 UTC
I really like how iTunes does this, because it doesn't actually cut out the "The", but it sorts the band as if the "The" had been cut.
Comment 4 sam tygier 2004-12-18 20:59:13 UTC
not sure how amarok works internally, but i guess there must be a database held in ram somewhere with all the loaded songs, and various bit of info about them. i imagine there is a name field. if an sort_name field could be added with 'a', 'the', and maybe 'der', 'die', 'das', 'le, 'la' truncated. when you sort by name the sort_name could be used instead of the actual name, but the actual name would be displayed.

i suppose the other way would be to fiddle with the sort algorithm, but this might slow down to sort function a lot.

note: itunes does not aply this sorting to album names, though prehaps it should.

note 2: the mac os x finder (file browser) does smart sorting on numbers. ie '9' before '10' in file names.
Comment 5 Christian Muehlhaeuser 2005-01-04 08:40:26 UTC
*** Bug 95313 has been marked as a duplicate of this bug. ***
Comment 6 Christian Muehlhaeuser 2005-01-04 08:47:31 UTC
*** Bug 96135 has been marked as a duplicate of this bug. ***
Comment 7 Laszlo Pandy 2005-02-24 16:17:39 UTC
*** This bug has been confirmed by popular vote. ***
Comment 8 Gleb Litvjak 2005-03-15 14:06:36 UTC
Could there be a functionality similar to that audioscrobbler has - the aliases? I mean, a user tells amarok to treat "Beatles" and "The Beatles" as the same band.
Comment 9 Thiago Macieira 2005-05-14 16:05:06 UTC
*** Bug 105647 has been marked as a duplicate of this bug. ***
Comment 10 Fred Mayhew 2005-05-25 16:31:45 UTC
Be sure that when removing the 'the' to not indiscriminately remove any 'the'. For instance, you wouldn't want to remove the 'the' from Tom Petty and the Heartbreakers. Be sure you make it functional so that it only removes it if is the first word.
Comment 11 Reigo Reinmets 2005-05-25 17:47:25 UTC
we CANNOT remove The from artist names. For example, "The Prodigy"(techno) and "Prodigy"(rap) are NOT the same artists!
Comment 12 Jim Higson 2005-05-25 17:49:53 UTC
Even if "The Prodigy" is displayed, both should be sorted under P, perhaps?
Comment 13 Laszlo Pandy 2005-05-26 06:47:33 UTC
Look at how iTunes does it of you can. If you have two artists; "Prodigy" and "The Prodigy", they would be listed in the collection browser right next to each other, because amarok would sort them as two copies of the same name, and display them as two different names. 
Comment 14 Josh 2005-06-17 14:59:21 UTC
RE:
"we CANNOT remove The from artist names. For example, "The Prodigy"(techno) and "Prodigy"(rap) are NOT the same artists! "

I don't know about you guys....but this would NOT affect me at all and if it did it would probably be one artist and worth the bother instead of always having to scroll through the massive amount of 'the' artists...besides if its really that much trouble you can have a feature that you can turn on or off that appends the 'the' to the end...so The Beatles becomes Beatles, The. it'll look like hell but that way its easy to differentiate if you have a lot of artists w/ the same name (which no one does...) but that's why it'll be an option that can be turned off
Comment 15 Matt Howe 2005-06-17 15:37:41 UTC
RE: "we CANNOT remove The from artist names. For example, "The Prodigy"(techno) and "Prodigy"(rap) are NOT the same artists!"

Why does that even matter if it only affects the sort order? THE Prodigy would still be there with the same name, the only difference is that it would be with the Ps instead of the Ts.

This sounds like an excellent idea to me, I usually rename "The" bands to "Band, The", this would save me the trouble.
Comment 16 Seb Ruiz 2005-06-27 14:22:33 UTC
I'll fix this by 1.3 final
Comment 17 Seb Ruiz 2005-06-30 07:47:01 UTC
SVN commit 430109 by seb:

* Ignore artists and albums beginning with 'the ' in the collection browser and sort accordingly.
* Allow ascending/descending sorting in the collection browser
* Huge amounts of style fixes, making code easier to read

BUG: 85959


 M  +122 -32   collectionbrowser.cpp  
 M  +4 -2      collectionbrowser.h  
Comment 18 Anthony Ivanoff 2005-06-30 12:54:55 UTC
Thank you very much!

And is it possible also to make the same to remove all these  "feat." too?
Comment 19 Seb Ruiz 2005-06-30 13:39:04 UTC
isnt a file usually like this:

Santana - Smooth (feat. Rob Thomas)

How would removing feat. accomplish anything?
Comment 20 Laszlo Pandy 2005-07-01 17:57:33 UTC
Just FYI, adding this feature has broken dragging and dropping for the artists which are altered by this feature. I've mage a bug report: http://bugs.kde.org/show_bug.cgi?id=108408
Comment 21 Anthony Ivanoff 2005-07-01 18:51:11 UTC
Seb Ruiz,

I mean, in the artist name only.

Like "DJ Sash feat. BoneyM - Ma Baker"
Comment 22 Gleb Litvjak 2005-07-01 22:39:24 UTC
@You Don't Know Me:

Well, usually it is like this:

Artist: DJ Sash
Track:  Ma Baker (feat. BoneyM)

(see http://www.audioscrobbler.com/forum/13179/_/28790 for a short discussion)
Comment 23 plax 2005-07-27 14:56:34 UTC
The problem doesn't only affect the sorting of the artist's name, but also amaroK's whole handling of contextual information (smart playlists, ...) (for example, if I listen to a song by The Beatles, amaroK doesn't list songs by Beatles in the categories "Favourite songs by this artist").
On the other hand, with a collection of 1000+ music files there are often several ways of writing an artists name. Therefore I would suggest a more tolerant similiarity search over artist names. amaroK could then keep a list of artist names the user has entered (or maybe share this list over the internet) and use a name from that list if a found name is very similar.
From my experience, the following rules would be helpful:
* Ignore uppercase/lowercase (eg Rem = REM)
* Ignore non-alphanumerics (eg. AC-DC = AC/DC = ACDC, R.E.M = REM)
* Ignore The, Der, Die, Das, Los, Las, El, La, Le, Les, ... at least as a first word (this list should hopefully be enough for English, German, Spanish and French artists. (Idea taken from Sam Tygier))

@Reigo Reinmets: Personally, I could live with "Prodigy" and "The prodigy" being recognized as only one artist. Having music from one artist that is stored under serveral names is more annoying and frequent than having two artists with such a similar name. BTW, having such a list of artists that could contain both Prodigys could also solve that conflict.
Comment 24 Jim Higson 2005-07-27 15:36:03 UTC
> Ignore The, Der, Die, Das, Los, Las, El, La, Le, Les, ...

Also: MC, DJ, "The Artist Formerly Known As"

Two of which are serious.
Comment 25 Michael Rodriguez-Torrent 2005-08-02 18:00:29 UTC
Implementing the same ignoring in playlist sorting would be nice. :)
Comment 26 Tobias Niwi 2005-08-18 09:09:39 UTC
The problem is only solved when the "Group By" scheme starts with "artist". All other schemes (like the popular "Genre / Artist / Album") are not supported yet.
Comment 27 Bonnaud Frédéric 2006-04-25 18:46:44 UTC
i totaly agree with comment #23 and better, the list of first words that should be ignore on artist's order must be editable. It will be a must if we can also choose to ignore some words for the whole artist's name
Comment 28 Tilman Vogel 2006-09-12 14:07:02 UTC
What are the plans for supporting prefixes other then "the", e.g. German ones? I very much favor a user-specifiable list of prefixes with a sensible default. Shouldn't this feature request be reopened until this is implemented?
Comment 29 Seb Ruiz 2006-09-12 14:18:04 UTC
see bug 121931