Ticket #210 (reopened defect)

Opened 3 years ago

Last modified 3 years ago

Blog (folder) titles in non-English UTF-8 are inaccessible

Reported by: artemfrolov@… Owned by: anonymous
Priority: normal Milestone: Gregarius 0.5.5
Component: BUGS Version:
Severity: normal Keywords: Unicode utf-8 mod_rewrite titles
Cc:

Description

This particular feed has russian title in UTF-8

http://www.livejournal.com/users/eprilo/data/rss

This title is translated to series of underscores, so my link in gregarius for this blog looks like http://www.artemfrolov.com/rss/__________/ and clicking on it drops me to some seemingly random (though always the same) blog folder.

Even if it worked right and showed me the blog, this translation scheme would result in lots of collisions for UTF-8 names (i.e. 2 completely different titles translate to 10 underscores).

Just another wild observation, I have not checked it myself yet, though. What if blog name is 'admin' or 'update.php' in that case blog URL will be an internal Gregarius URL, and either blog or management function (?) might be inaccessible.

Change History

Changed 3 years ago by anonymous

  • owner changed from mbonetti to anonymous
  • status changed from new to assigned
  • milestone set to Gregarius 0.5.2

Oops: we did the actual item titles and forgot the feed titles.

Changed 3 years ago by anonymous

yes, I'm seeing the same problem for feeds in traditional Chinese as well see http:// edmondhui dot homeip dot net alash gregarius

Changed 3 years ago by anonymous

[952] fixes this. I hope it won't break other feeds.

Changed 3 years ago by mbonetti

  • status changed from assigned to closed
  • resolution set to fixed

Resolving as fixed (by [952]) for the time being.

Changed 3 years ago by anonymous

quote: "Just another wild observation, I have not checked it myself yet, though. What if blog name is 'admin' or 'update.php' in that case blog URL will be an internal Gregarius URL, and either blog or management function (?) might be inaccessible."

I encounter exactly this problem on my setup... 3 of my feeds title "My Blogs" and they have the same link....

URL is http://edmondhui.homeip.net/gregarius/ it's just a test so feel free to mess with it.

Changed 3 years ago by anonymous

hum... a chinese feed has wierd/incorrect title encoding, the url for the feed http://edmondhui.homeip.net/gregarius/#22826;_#34395;_#24187;_#22659;/

Changed 3 years ago by anonymous

  • status changed from closed to reopened
  • resolution fixed deleted

Changed 3 years ago by anonymous

Actually, to clearify my eariler post, I have 3 feeds with the same title and the url to display the item for the item is the same http://edmondhui.homeip.net/gregarius/My_Blog/ for all 3 links.... not exactly as quoted.....

Changed 3 years ago by anonymous

forgot to mention, my email is ymehui at y a h o o dot com

Changed 3 years ago by anonymous

  • milestone changed from Gregarius 0.5.2 to Gregarius 0.5.3

Changed 3 years ago by ymehui at yahoo com

I am using the latest SVN code and noticed that the link to show post by month and by date is displaying the utf-8 correct w/ link like http://edmondhui dot homeip dot net/gregarius/_/2005/11/09/

Changed 3 years ago by mdodoo

  • status changed from reopened to closed
  • resolution set to fixed

As far as I can tell, the original problem with feeds' titles not displaying properly in UTF-8 URLs has been fixed, hence I am closing this ticket.

ymehui, your current problem is different that the one that this ticket was created for. Normally, I would advise you to create a new ticket, except that I am certain that there was a previous ticket describing the same problem. I could not immediately find it, but feel free to contact me if you have trouble finding it. Basically, you have two ways to solve the "My_Blog" problem:

  • rename two of the feeds' title in Gregarius to something else
  • stop using mod_rewrite

Changed 3 years ago by ymehui at yahoo com

thanks. I agree "rename two of the feeds' title in Gregarius to something else" is the work around for now. But I think in the long run the way the mod_write work might need to change as I see that there are no identifier in the URL for accessing tag/feed/etc. I already encounter another one similar problem (see: http://svn.gregarius.net/trac/ticket/256")

ie http://edmondhui dot homeip dot net/gregarius/show_by_date/blahblah_title/2005/11/09/

http://edmondhui dot homeip dot net/gregarius/tag/blog

cheers,

Changed 3 years ago by anonymous

FYI, http://svn.gregarius.net/trac/ticket/256 is opened for the 3 "MyBlog?" links problem.

Changed 3 years ago by anonymous

  • status changed from closed to reopened
  • resolution fixed deleted

I installed the latest nightly but the "dirified" titles are still munged for the rewritten permalinks (right-side column in default theme).

Changed 3 years ago by anonymous

  • summary changed from Blog titles in non-English UTF-8 are inaccessible to Blog (folder) titles in non-English UTF-8 are inaccessible

To fix the problem with the folder url, the same utf8_uri_encode treatment for FeedListItem? needs to be applied to FeedFolder?, as well.

Changed 3 years ago by anonymous

  • keywords Unicode utf-8 mod_rewrite titles added

Similar problem with author (e.g.)

http://site.net/rss/author/

Changed 3 years ago by anonymous

Same design problem with the Feed function in items.php

Note: See TracTickets for help on using tickets.