I would like to know if I can change the location of a noise file on a
per-database (or filegroup) level?
My situation is this:
I have two databases operating on the same machine, same language, same
instance, different file groups. I have one application (using one of the
databases) that has a list of words that I want FTS to ignore when
searching. I have another application (using the othe database) theat not
only should include some of those words in its searching, but has its own
list of words to ignore.
Is what I want to do possible?
Any help is appreciated
Thanks in advance.
WALDO
Waldo,
While it is not possible to move or have duplicate noise word files in SQL
Server 2000 (post SELECT @.@.version output), you may be able to achieve your
goals via using US_English (noise.enu) for one database's FT-enabled tables
and UK_English for another database's FT-enabled table as both noise word
files are both English based.
You can reference different languages via "Language for Word Breaker" and
therefore different noise word files for each table. I
believe I posted an answer to your previous and differently worded question
as:
For SQL Server 2000, you could define a table in DATABASE A to use "British
English", i.e, the noise.eng noise word file, and add your custom noise
words here and when you create the FT Catalog, you define the "British
English" for that column. While, in DATABASE B, you would use US English and
noise.enu. This should meet your requirements
FYI, the noise word file locations are controlled via registry keys:
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\ContentIndex Common\LanguageResources\O
verride\SQLServer\English (United States)
NoiseFile value= F:\MSSQL80\MSSQL\FTData\SQLServer\Config\noise.enu
Hope that helps!
John
SQL Full Text Search Blog
http://spaces.msn.com/members/jtkane/
"Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
news:#r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl...
> I would like to know if I can change the location of a noise file on a
> per-database (or filegroup) level?
> My situation is this:
> I have two databases operating on the same machine, same language,
same
> instance, different file groups. I have one application (using one of the
> databases) that has a list of words that I want FTS to ignore when
> searching. I have another application (using the othe database) theat not
> only should include some of those words in its searching, but has its own
> list of words to ignore.
> Is what I want to do possible?
> Any help is appreciated
> Thanks in advance.
> WALDO
>
|||You can use the US English word breaker for one, and the UK English word
breaker for another. This should work for you are you appear to be in PA.
The UK and US English word lists, breakers and stemmers are almost
identical.
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
news:%23r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl...
> I would like to know if I can change the location of a noise file on a
> per-database (or filegroup) level?
> My situation is this:
> I have two databases operating on the same machine, same language,
same
> instance, different file groups. I have one application (using one of the
> databases) that has a list of words that I want FTS to ignore when
> searching. I have another application (using the othe database) theat not
> only should include some of those words in its searching, but has its own
> list of words to ignore.
> Is what I want to do possible?
> Any help is appreciated
> Thanks in advance.
> WALDO
>
|||How did you know that I was in PA?
"Hilary Cotter" <hilary.cotter@.gmail.com> wrote in message
news:eO64FpNnFHA.1968@.TK2MSFTNGP14.phx.gbl...[vbcol=seagreen]
> You can use the US English word breaker for one, and the UK English word
> breaker for another. This should work for you are you appear to be in PA.
> The UK and US English word lists, breakers and stemmers are almost
> identical.
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
> Looking for a FAQ on Indexing Services/SQL FTS
> http://www.indexserverfaq.com
> "Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
> news:%23r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl...
> same
the[vbcol=seagreen]
not[vbcol=seagreen]
own
>
|||Wow, just the 2 people I wanted to reply to this.
SELECT @.@.version:
Microsoft SQL Server 2000 - 8.00.760 (Intel X86)
Dec 17 2002 14:22:05
Copyright (c) 1988-2003 Microsoft Corporation
Enterprise Edition on Windows NT 5.0 (Build 2195: Service Pack 4)
I was actually debating using a different laguage word-breaker/stemmer for
each database. Is it possible to create my own word-breaker/stemmer COM
classes as long as they implement the correct interfaces?
For instance, have my own "language" with a fictional locale id, create the
registry entries for my language, then supply a "noise.myLang1" file for the
noise words.
Is that possible?
"John Kane" <jt-kane@.comcast.net> wrote in message
news:u5pf6UInFHA.1968@.TK2MSFTNGP14.phx.gbl...
> Waldo,
> While it is not possible to move or have duplicate noise word files in SQL
> Server 2000 (post SELECT @.@.version output), you may be able to achieve
your
> goals via using US_English (noise.enu) for one database's FT-enabled
tables
> and UK_English for another database's FT-enabled table as both noise word
> files are both English based.
> You can reference different languages via "Language for Word Breaker" and
> therefore different noise word files for each table. I
> believe I posted an answer to your previous and differently worded
question
> as:
> For SQL Server 2000, you could define a table in DATABASE A to use
"British
> English", i.e, the noise.eng noise word file, and add your custom noise
> words here and when you create the FT Catalog, you define the "British
> English" for that column. While, in DATABASE B, you would use US English
and
> noise.enu. This should meet your requirements
> FYI, the noise word file locations are controlled via registry keys:
>
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\ContentIndex Common\LanguageResources\O[vbcol=seagreen]
> verride\SQLServer\English (United States)
> NoiseFile value= F:\MSSQL80\MSSQL\FTData\SQLServer\Config\noise.enu
> Hope that helps!
> John
> --
> SQL Full Text Search Blog
> http://spaces.msn.com/members/jtkane/
>
> "Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
> news:#r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl...
> same
the[vbcol=seagreen]
not[vbcol=seagreen]
own
>
|||There sheer genius of your post make it unmistakably someone from PA. That
and your nntp headers
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
news:%23S1IQVQnFHA.2580@.TK2MSFTNGP09.phx.gbl...[vbcol=seagreen]
> How did you know that I was in PA?
> "Hilary Cotter" <hilary.cotter@.gmail.com> wrote in message
> news:eO64FpNnFHA.1968@.TK2MSFTNGP14.phx.gbl...
PA.
> the
> not
> own
>
|||Little off topic, (ok very)
Just out of sheer morbid curiosity, what in my NNTP headers indicates PA? I
would be interested in knowing just for my own personal gratification. This
is what mine says when I look at them.
From: "Rob Mayo" <NOSPAM@.NOSPAM.com>
Subject: Can I specify location of noise file?
Date: Mon, 8 Aug 2005 12:40:38 -0400
Lines: 19
X-Priority: 3
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2800.1506
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1506
Message-ID: <#r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl>
Newsgroups: microsoft.public.sqlserver.fulltext
NNTP-Posting-Host: nat.westonsolutions.com 162.43.198.100
Path: TK2MSFTNGP08.phx.gbl!TK2MSFTNGP12.phx.gbl
Xref: TK2MSFTNGP08.phx.gbl microsoft.public.sqlserver.fulltext:15271
I see in yours that they came from Toronto
"Hilary Cotter" <hilary.cotter@.gmail.com> wrote in message
news:%235nffZQnFHA.2180@.TK2MSFTNGP15.phx.gbl...[vbcol=seagreen]
> There sheer genius of your post make it unmistakably someone from PA. That
> and your nntp headers
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
> Looking for a FAQ on Indexing Services/SQL FTS
> http://www.indexserverfaq.com
> "Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
> news:%23S1IQVQnFHA.2580@.TK2MSFTNGP09.phx.gbl...
word[vbcol=seagreen]
> PA.
a[vbcol=seagreen]
language,[vbcol=seagreen]
of[vbcol=seagreen]
theat[vbcol=seagreen]
its
>
|||I did a tracert on your IP
C:\Documents and Settings\hcotter>tracert 162.43.198.100
Tracing route to nat.westonsolutions.com [162.43.198.100]
over a maximum of 30 hops:
1 6 ms 6 ms 8 ms 172.16.1.1
2 20 ms 40 ms 16 ms 64.230.197.195
3 49 ms 41 ms 22 ms 64.230.234.37
4 15 ms 15 ms 13 ms 64.230.221.105
5 31 ms 63 ms 62 ms core2-chicago23-pos10-0.in.bellnexxia.net
[206.1
08.103.118]
6 57 ms 30 ms 25 ms bx2-chicago23-pos10-0.in.bellnexxia.net
[206.108
..103.122]
7 43 ms 58 ms 35 ms sl-gw36-chi-12-0.sprintlink.net
[160.81.109.193]
8 25 ms 36 ms 34 ms sl-bb20-chi-5-0.sprintlink.net
[144.232.26.69]
9 25 ms 27 ms 96 ms sl-bb25-chi-8-0.sprintlink.net
[144.232.26.113]
10 46 ms 45 ms 58 ms sl-bb26-rly-10-0.sprintlink.net
[144.232.20.88]
11 47 ms 68 ms 79 ms sl-bb24-pen-12-0.sprintlink.net
[144.232.20.110]
12 99 ms 74 ms 69 ms sl-bb20-pen-8-0.sprintlink.net
[144.232.5.165]
13 83 ms 54 ms 73 ms sl-gw2-pen-9-0.sprintlink.net [144.232.5.10]
14 47 ms 45 ms 47 ms sl-internap-121-0.sprintlink.net
[144.223.16.234
]
15 52 ms 45 ms 45 ms border1.fe0-0-bbnet1.phi.pnap.net
[216.52.64.4]
16 * rfweston-4.border1.phi.pnap.net [216.52.66.38] reports:
Destinati
on net unreachable.
Trace complete.
I notice a lot of references to pen and phi in there.
I am actually based out of NJ, I am at a client site north of Toronto.
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
news:On3aS6QnFHA.2916@.TK2MSFTNGP14.phx.gbl...
> Little off topic, (ok very)
> Just out of sheer morbid curiosity, what in my NNTP headers indicates PA?
I
> would be interested in knowing just for my own personal gratification.
This[vbcol=seagreen]
> is what mine says when I look at them.
> From: "Rob Mayo" <NOSPAM@.NOSPAM.com>
> Subject: Can I specify location of noise file?
> Date: Mon, 8 Aug 2005 12:40:38 -0400
> Lines: 19
> X-Priority: 3
> X-MSMail-Priority: Normal
> X-Newsreader: Microsoft Outlook Express 6.00.2800.1506
> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1506
> Message-ID: <#r4rpfDnFHA.860@.TK2MSFTNGP12.phx.gbl>
> Newsgroups: microsoft.public.sqlserver.fulltext
> NNTP-Posting-Host: nat.westonsolutions.com 162.43.198.100
> Path: TK2MSFTNGP08.phx.gbl!TK2MSFTNGP12.phx.gbl
> Xref: TK2MSFTNGP08.phx.gbl microsoft.public.sqlserver.fulltext:15271
> I see in yours that they came from Toronto
>
> "Hilary Cotter" <hilary.cotter@.gmail.com> wrote in message
> news:%235nffZQnFHA.2180@.TK2MSFTNGP15.phx.gbl...
That[vbcol=seagreen]
> word
in[vbcol=seagreen]
on
> a
> language,
> of
> theat
> its
>
|||You're welcome, Rob,
First of all, thank you for the @.@.version info as this tell me that you're
using SQL 2000 SP3 on Win2K SP4. The OS-platform info is important as with
SQL 2000, the OS-supplied English wordbreaker is (in this case)
infosoft.dll.
As for having your own "fictional" language and fictional locale id with
corresponding registry keys, etc., that may be possible as that is how the
IFilter sample apps are setup to demonstrate the use of the interfaces. The
following links should be helpful in that regard:
Word Breaker and Stemmer Sample
http://msdn.microsoft.com/library/de...nario_3e91.asp
Implementing a Stemmer
http://msdn.microsoft.com/library/de...nario_1bg3.asp
Implementing a Word Breaker
http://msdn.microsoft.com/library/de...nario_54bp.asp
About Language Resources
http://msdn.microsoft.com/library/de...nario_3u2c.asp
Troubleshooting Language Resources
http://msdn.microsoft.com/library/de...nario_5dwz.asp
As for integrating the fictional "language" into SQL Server FTS in a real
production environment that uses "English" as the actual text language, I'm
not sure that is possible. However, feel free to ping me directly if you
want to discuss this in more detail.
Thanks,
John
SQL Full Text Search Blog
http://spaces.msn.com/members/jtkane/
"Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
news:Ovl4yYQnFHA.4064@.TK2MSFTNGP10.phx.gbl...
> Wow, just the 2 people I wanted to reply to this.
> SELECT @.@.version:
> Microsoft SQL Server 2000 - 8.00.760 (Intel X86)
> Dec 17 2002 14:22:05
> Copyright (c) 1988-2003 Microsoft Corporation
> Enterprise Edition on Windows NT 5.0 (Build 2195: Service Pack 4)
> I was actually debating using a different laguage word-breaker/stemmer for
> each database. Is it possible to create my own word-breaker/stemmer COM
> classes as long as they implement the correct interfaces?
> For instance, have my own "language" with a fictional locale id, create
the
> registry entries for my language, then supply a "noise.myLang1" file for
the[vbcol=seagreen]
> noise words.
> Is that possible?
>
> "John Kane" <jt-kane@.comcast.net> wrote in message
> news:u5pf6UInFHA.1968@.TK2MSFTNGP14.phx.gbl...
SQL[vbcol=seagreen]
> your
> tables
word[vbcol=seagreen]
and
> question
> "British
> and
>
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\ContentIndex Common\LanguageResources\O
> the
> not
> own
>
|||Damn, cool
"Hilary Cotter" <hilary.cotter@.gmail.com> wrote in message
news:eh%23OlxRnFHA.3288@.TK2MSFTNGP10.phx.gbl...
> I did a tracert on your IP
> C:\Documents and Settings\hcotter>tracert 162.43.198.100
> Tracing route to nat.westonsolutions.com [162.43.198.100]
> over a maximum of 30 hops:
> 1 6 ms 6 ms 8 ms 172.16.1.1
> 2 20 ms 40 ms 16 ms 64.230.197.195
> 3 49 ms 41 ms 22 ms 64.230.234.37
> 4 15 ms 15 ms 13 ms 64.230.221.105
> 5 31 ms 63 ms 62 ms core2-chicago23-pos10-0.in.bellnexxia.net
> [206.1
> 08.103.118]
> 6 57 ms 30 ms 25 ms bx2-chicago23-pos10-0.in.bellnexxia.net
> [206.108
> .103.122]
> 7 43 ms 58 ms 35 ms sl-gw36-chi-12-0.sprintlink.net
> [160.81.109.193]
> 8 25 ms 36 ms 34 ms sl-bb20-chi-5-0.sprintlink.net
> [144.232.26.69]
> 9 25 ms 27 ms 96 ms sl-bb25-chi-8-0.sprintlink.net
> [144.232.26.113]
> 10 46 ms 45 ms 58 ms sl-bb26-rly-10-0.sprintlink.net
> [144.232.20.88]
> 11 47 ms 68 ms 79 ms sl-bb24-pen-12-0.sprintlink.net
> [144.232.20.110]
> 12 99 ms 74 ms 69 ms sl-bb20-pen-8-0.sprintlink.net
> [144.232.5.165]
> 13 83 ms 54 ms 73 ms sl-gw2-pen-9-0.sprintlink.net
[144.232.5.10][vbcol=seagreen]
> 14 47 ms 45 ms 47 ms sl-internap-121-0.sprintlink.net
> [144.223.16.234
> ]
> 15 52 ms 45 ms 45 ms border1.fe0-0-bbnet1.phi.pnap.net
> [216.52.64.4]
> 16 * rfweston-4.border1.phi.pnap.net [216.52.66.38] reports:
> Destinati
> on net unreachable.
> Trace complete.
> I notice a lot of references to pen and phi in there.
> I am actually based out of NJ, I am at a client site north of Toronto.
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
> Looking for a FAQ on Indexing Services/SQL FTS
> http://www.indexserverfaq.com
> "Rob Mayo" <NOSPAM@.NOSPAM.com> wrote in message
> news:On3aS6QnFHA.2916@.TK2MSFTNGP14.phx.gbl...
PA?[vbcol=seagreen]
> I
> This
> That
English[vbcol=seagreen]
> in
file[vbcol=seagreen]
> on
one[vbcol=seagreen]
when[vbcol=seagreen]
has
>
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment