|
|
 |
RE: FN-FORUM: MySql Escaping -
date posted 17th October 2006 13:11
> -----Original Message-----
> From: Tony Crockford [EMAIL REMOVED]
> Sent: 17 October 2006 12:28
> To: Charles E. Gatenby
> Subject: Re: FN-FORUM: MySql Escaping -
>
>
> Charles E.Gatenby wrote:
> > Dam.. I have access to neither... :( But thanks any way. I
> think the
> > only thing for me to do is replace the - with some other
> character and
> > then replace it back when it gets displayed..
>
> you mean when you're storing it and retrieving it and setting
> up the search?
>
> are you sure you want a fulltext search type? seems to me
> that it's a fuzzy match (natural language matching) at best
> and you seem to be looking for exact word matching? is there
> a reason you're using FULLTEXT?
Yes, defiantly need fulltext.
>
> that would make sense, but check the mysql manual for the
> other delimiters first...
>
> I thought you might get away with just losing it, but spaces
> are also word delimiters and short words aren't included in
> the search anyway
> AIUI: so you'd end up looking for "weston mare" (BTW isn't
> it super not
> sub?)
Indeed it is weston-super-mare, although I wasn't using that in my test,
which is why Ben's solution didn't work.... D'oh.
>
> The FULLTEXT parser determines where words start and end by
> looking for certain delimiter characters; for example, ' '
> (space), ',' (comma), and '.' (period). If words are not
> separated by delimiters (as in, for example, Chinese), the
> FULLTEXT parser cannot determine where a word begins or ends.
> To be able to add words or other indexed terms in such
> languages to a FULLTEXT index, you must preprocess them so
> that they are separated by some arbitrary delimiter such as '"'.
>
> Some words are ignored in full-text searches:
> Any word that is too short is ignored. The default minimum
> length of words that are found by full-text searches is four
> characters.
And hence why my test for "hay-on-wye" return nothing..
>
> Words in the stopword list are ignored. A stopword is a word
> such as "the" or "some" that is so common that it is
> considered to have zero semantic value. There is a built-in
> stopword list, but it can be overwritten by a user-defined list.
>
>
Time reduce the char limit to 2 me thinks...
|
 |
|