none
Foreign Characters in Headings Fool TOC RRS feed

  • Question

  • In the past we have found that zero characters, "0", in headings can fool the Table of Contents feature. The problem isn't with the heading itself, but the <a name> tag in the HTML used by the TOC. The fix is to replace "0" characters in the <a name> tags in the HTML editor with something else. Otherwise the heading will not appear in the TOC.

    I just found that the same problem happens if the heading has certain foreign (non-English) characters. I found 5 headings in the following article that did not appear in the TOC:

    http://social.technet.microsoft.com/wiki/pt-br/contents/articles/15942.asp-net-mvc.aspx

    The characters that cause the problem (that I have found so far) are:

    é í ç ã

    I fixed the article above by replacing these characters in the <a name> tags in the HTML editor with the following: e i c a. This probably makes no sense in Portuguese, but now the TOC works. I updated the following Wiki article to describe the problem (Problem 2):

    http://social.technet.microsoft.com/wiki/contents/articles/12687.tips-tricks-to-fix-the-wiki-article-toc-table-of-contents.aspx

    Has anyone encountered this before? Do all non-english characters cause this problem? Lots of non-English articles have lots of non-English characters.


    Richard Mueller - MVP Directory Services

    Friday, February 15, 2013 9:13 PM
    Moderator

Answers

All replies

  • A header line in the first article I linked above (# 15942) appeared as follows in the HTML editor before I fixed it.

    <h3><a name="Instalação"></a><a name="Instalação"></a><a name="Instalação"></a>Instalação</h3>

    -----

    The <a name> tag is repeated each time the article is saved, because it is not recognized as valid, and the heading never showed in the TOC. I fixed this line in the HTML editor as follows:

    <h3><a name="Instalacao"></a>Instalação</h3>

    -----

    I removed the duplicates and replaced "çã" with "ca". However, now I find the following Wiki article with similar characters in the headings:

    http://social.technet.microsoft.com/wiki/contents/articles/6273.guia-de-sobrevivencia-system-center-orchestrator-2012-pt-br.aspx

    Here is a heading line from the article as it appears in the HTML editor:

    <h1><a name="Integra_ccedil_atilde_o"></a>Integração</h1>

    -----

    This line appears in the TOC, as there are no non-English characters in the <a name> tag. But what process substituted "_ccedil_atilde_" for "çã"? I know it wasn't the Wiki editor.


    Richard Mueller - MVP Directory Services

    Saturday, February 16, 2013 2:42 AM
    Moderator
  • ccedil is the character's name :) cedil on the c you could tell it in english.

    I confirm it's not the wiki editor that wrote that

    The wiki should save it that way, maybe we should create a bug request for that. The [toc] function/script seem bugged for international's characters.


    MCP | MCTS 70-236: Exchange Server 2007, Configuring

    Twitter - @yagmoth555 ()
    Blog: http://www.jabea.net | http://blogs.technet.com/b/wikininjas/


    • Edited by Yagmoth555MVP Saturday, February 16, 2013 4:44 AM removed the ie bug, as on 3 tests it worked the last test
    Saturday, February 16, 2013 4:35 AM
  • Ah yes. I saw you fixing it and started fixing some other articles as well.

    I submitted this as a bug request.

    Thanks!


    Ed Price (a.k.a User Ed), SQL Server Customer Program Manager (Blog, Small Basic, Wiki Ninjas, Wiki)

    Answer an interesting question? Create a wiki article about it!

    Saturday, February 16, 2013 7:28 AM
    Owner