2012年2月19日星期日

FTS does not index htm file in image column

Hi,
I have FT indexing set up on an image column, the column is binded to a
document type column and the FT indexing works fine. My problem is that
when a html file has the following utf-8 metatag
<META http-equiv=Content-Type content="text/html; charset=utf-16">
then the FT searches do not work on that particular file at all. Does
anyone know why this should be the case? Is there a way of making FT
index these files?
thanks in advance,
Paul
what is the value of your document type column? Are there any errors in your
gatherer log?
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
<paul.craigie@.gmail.com> wrote in message
news:1105972135.333652.164980@.f14g2000cwb.googlegr oups.com...
> Hi,
> I have FT indexing set up on an image column, the column is binded to a
> document type column and the FT indexing works fine. My problem is that
> when a html file has the following utf-8 metatag
> <META http-equiv=Content-Type content="text/html; charset=utf-16">
> then the FT searches do not work on that particular file at all. Does
> anyone know why this should be the case? Is there a way of making FT
> index these files?
> thanks in advance,
> Paul
>
|||thanks for prompt response.
the value of the documenttype column for a html file that does not get
returned by FTS is corpUpdate3.htm. For comparison, the documenttype
value for another record which does get returned (i.e. does not have
the utf-8 meta tag) is corpUpdate4.htm . I'm not sure what you mean by
gatherer log, I have checked the event viewer and there's nothing there
from MS Search or msci.
Paul
|||I think you will need to add the MS.Locale language tag for this to work.
Let me try to repro it.
Also, the DocumentType column contents should be htm, html, .htm, or .html
<paul.craigie@.gmail.com> wrote in message
news:1105974655.007897.288270@.z14g2000cwz.googlegr oups.com...
> thanks for prompt response.
> the value of the documenttype column for a html file that does not get
> returned by FTS is corpUpdate3.htm. For comparison, the documenttype
> value for another record which does get returned (i.e. does not have
> the utf-8 meta tag) is corpUpdate4.htm . I'm not sure what you mean by
> gatherer log, I have checked the event viewer and there's nothing there
> from MS Search or msci.
> Paul
>
|||Hi Hilary,
I've worked out where the gatherer log files are. There are lots of
this kind of error in there ..
f28c7c9b1c4fcb0MSSQL75://SQLServer/3805392f/000000638000001c080040e21Multiple-step
OLE DB operation generated errors. Check each OLE DB status value, if
available. No work was done.
does this tell us anything?
Paul
|||It could. Could you possibly post the html file here?
Also is there anything non-standard about your machine? Do you have
Sharepoint, Yukon, Exchange running on this machine? Is this a cluster?
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
<paul.craigie@.gmail.com> wrote in message
news:1105979877.237531.111230@.c13g2000cwb.googlegr oups.com...
> Hi Hilary,
> I've worked out where the gatherer log files are. There are lots of
> this kind of error in there ..
> f28c7c9b 1c4fcb0 MSSQL75://SQLServer/3805392f/00000063 8000001c 0 80040e21
Multiple-step
> OLE DB operation generated errors. Check each OLE DB status value, if
> available. No work was done.
> does this tell us anything?
> Paul
>
|||OK,
the first test file looks like this ..
<html>
<head><META http-equiv=Content-Type content="text/html;
charset=utf-8"></head>
<body>
<p>paultest2</p>
</body>
</html>
and the second test file looks like this
<html>
<head></head>
<body>
<p>paultest2</p>
</body>
</html>
when I do a search on 'paultest2', only the second file is returned.
The SQL server is on a dedicated DB server, there are no other apps
running on it. I tried the MS.Locale language tag but it hasn't made
any difference to the chances of a file being returned by FTS.
Paul

没有评论:

发表评论