<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.kde.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.6"
          urlbase="https://bugs.kde.org/"
          
          maintainer="sysadmin@kde.org"
>

    <bug>
          <bug_id>259318</bug_id>
          
          <creation_ts>2010-12-09 13:01:29 +0000</creation_ts>
          <short_desc>Dolphin&apos;s Nepomuk search doesn&apos;t handle accents properly</short_desc>
          <delta_ts>2011-09-28 07:19:07 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>10</classification_id>
          <classification>Unmaintained</classification>
          <product>nepomuk</product>
          <component>general</component>
          <version>4.1</version>
          <rep_platform>Ubuntu</rep_platform>
          <op_sys>Linux</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>UPSTREAM</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>NOR</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>0</everconfirmed>
          <reporter name="Alvaro Manuel Recio Perez">amrecio</reporter>
          <assigned_to name="Sebastian Trueg">sebastian</assigned_to>
          <cc>alexvpetrov</cc>
    
    <cc>jens</cc>
    
    <cc>kde</cc>
    
    <cc>trueg</cc>
          
          <cf_commitlink></cf_commitlink>
          <cf_versionfixedin></cf_versionfixedin>
          <cf_sentryurl></cf_sentryurl>
          <votes>20</votes>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>1055800</commentid>
    <comment_count>0</comment_count>
    <who name="Alvaro Manuel Recio Perez">amrecio</who>
    <bug_when>2010-12-09 13:01:29 +0000</bug_when>
    <thetext>Version:           4.1 (using KDE 4.5.85) 
OS:                Linux

First of all, I don&apos;t really know if this bug belongs to Nepomuk, Strigi or Dolphin.

Mi name is Álvaro (with an accented A) and I have a lot of documents indexed by Nepomuk (actually I guess Strigi indexed them) with my name in their contents. If I try to search for &quot;Álvaro&quot; (with an accent), I get no results. If I search for &quot;Alvaro&quot;, I get the documents that contain either &quot;Alvaro&quot; or &quot;Álvaro&quot;.

I&apos;ve tried to do the same with other accented words and the effect is the same.

Reproducible: Always

Steps to Reproduce:
1. Index with Nepomuk a document containing an accented word.
2. Open Dolphin and try to search for that word using the search bar.

Actual Results:  
No results are shown.

Expected Results:  
Documents containing the word should be shown to the user.

OS: Linux (x86_64) release 2.6.35-23-generic
Compiler: cc</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1071275</commentid>
    <comment_count>1</comment_count>
    <who name="Jens Bergqvist">jens</who>
    <bug_when>2011-01-10 17:38:28 +0000</bug_when>
    <thetext>I have a similar situation in 4.5.95 (4.6 RC2) on Kubuntu 10.10 (32 and 64 bit), only I get no search results all for accented characters. In my case nepomuk/strigi does not associate e.g. á with a or ö with o, which seems to be what happens for the original reporter.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1075684</commentid>
    <comment_count>2</comment_count>
    <who name="Sebastian Trueg">trueg</who>
    <bug_when>2011-01-19 20:15:57 +0000</bug_when>
    <thetext>The next version of Virtuoso will contain a new configuration parameter that normalizes accents for full text queries.
I already added support for that configuration to Nepomuk. Thus, it will be used as soon as the new Virtuoso is installed.
However, only newly added text is affected. I will experiment with updating though.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1087666</commentid>
    <comment_count>3</comment_count>
    <who name="Sebastian Trueg">trueg</who>
    <bug_when>2011-02-14 14:25:31 +0000</bug_when>
    <thetext>*** Bug 266294 has been marked as a duplicate of this bug. ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1087669</commentid>
    <comment_count>4</comment_count>
    <who name="Ignacio Serantes">kde</who>
    <bug_when>2011-02-14 14:34:53 +0000</bug_when>
    <thetext>When the next version of Virtuoso will be available?

On the other side, queries in KDE 4.5 works well with unicode characters, in my case I use many Corean and Japanese characters and result was accurate so I wonder if this could be considered as a virtuoso problem.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1167767</commentid>
    <comment_count>5</comment_count>
    <who name="Sebastian Trueg">trueg</who>
    <bug_when>2011-09-28 07:19:07 +0000</bug_when>
    <thetext>*** Bug 282950 has been marked as a duplicate of this bug. ***</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>