This is a multi-part message in MIME format.
------_=_NextPart_001_01C89F80.02E09E4F
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Hi,
In a recent linux magazine article (http://www.linux-mag.com/id/5679)
there was a mentioning of Full-Text Search Integration. Which I know
nothing about, but sounded interesting to me. You might want to
check it out.
Regards,
Tena Sakai
tsakai@[EMAIL PROTECTED]
Message-----
From: pgsql-sql-owner@[EMAIL PROTECTED]
on behalf of Andreas
Sent: Tue 4/15/2008 8:15 PM
To: pgsql-sql@[EMAIL PROTECTED]
[SQL] How to find double entries
=20
Hi,
how can I find double entries in varchar columns where the content is=20
not 100% identical because of a spelling error or the person considered=20
it "looked nicer" that way?
I'd like to identify and then merge records of e.g. 'google', 'gogle', =
'guugle'=20
Then I want to match abbrevations like 'A-Company Ltd.', 'a company=20
ltd.', 'A-Company Limited'
Is there a way to do this?
It would be OK just to list candidats up to be manually checked =
afterwards.
Regards
Andreas
--=20
Sent via pgsql-sql mailing list (pgsql-sql@[EMAIL PROTECTED]
)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-sql
------_=_NextPart_001_01C89F80.02E09E4F
Content-Type: text/html;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7651.59">
<TITLE>RE: [SQL] How to find double entries</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->
<P><FONT SIZE=3D2>Hi,<BR>
<BR>
In a recent linux magazine article (<A =
HREF=3D"http://www.linux-mag.com/id/5679">http://www.linux-mag.com/id/567=
9</A>)<BR>
there was a mentioning of Full-Text Search Integration. Which I =
know<BR>
nothing about, but sounded interesting to me. You might want =
to<BR>
check it out.<BR>
<BR>
Regards,<BR>
<BR>
Tena Sakai<BR>
tsakai@[EMAIL PROTECTED]
>
<BR>
<BR>
-----Original Message-----<BR>
From: pgsql-sql-owner@[EMAIL PROTECTED]
on behalf of Andreas<BR>
Sent: Tue 4/15/2008 8:15 PM<BR>
To: pgsql-sql@[EMAIL PROTECTED]
>
Subject: [SQL] How to find double entries<BR>
<BR>
Hi,<BR>
<BR>
how can I find double entries in varchar columns where the content =
is<BR>
not 100% identical because of a spelling error or the person =
considered<BR>
it "looked nicer" that way?<BR>
<BR>
I'd like to identify and then merge records of e.g. =
'google', 'gogle',<BR>
'guugle'<BR>
<BR>
Then I want to match abbrevations like 'A-Company Ltd.', 'a =
company<BR>
ltd.', 'A-Company Limited'<BR>
<BR>
Is there a way to do this?<BR>
It would be OK just to list candidats up to be manually checked =
afterwards.<BR>
<BR>
<BR>
Regards<BR>
Andreas<BR>
<BR>
--<BR>
Sent via pgsql-sql mailing list (pgsql-sql@[EMAIL PROTECTED]
)<BR>
To make changes to your subscription:<BR>
<A =
HREF=3D"http://www.postgresql.org/mailpref/pgsql-sql">http://www.postgres=
ql.org/mailpref/pgsql-sql</A><BR>
<BR>
</FONT>
</P>
</BODY>
</HTML>
------_=_NextPart_001_01C89F80.02E09E4F--


|