Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Data Bases > Databases > Re: Novice neee...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 2 of 4 Topic 354 of 385
Post > Topic >>

Re: Novice neeeds help: Database with large text fields

by Brian Inglis <Brian.Inglis@[EMAIL PROTECTED] > Oct 27, 2006 at 04:19 AM

On Thu, 26 Oct 2006 21:49:11 -0500 in alt.comp.databases, Ed Katzman
<none@[EMAIL PROTECTED]
> wrote:

>Hi.  I'm a complete novice in the area of text processing and am hoping 
>that someone can point me in a good direction to get started.
>
>Here is the problem: I work in consumer products marketing. I have a 
>database of over 100,000 products.  Each record is for the initial 
>introduction of a product into the market and it provides some basic 
>overview information.  While some of the information is arranged in 
>separate fields (date of intro, manufacturer, etc.) most of the valuable 
>information for our purpose is contained in a free form description 
>field.
>
>I am hoping to do some cluster analysis or even some cladistics on the 
>data, but it seems like I need to pull the relevant text information out 
>of the description field and put it in some group of individual fields
>
>I don't know where to start to process this data so it comes out more 
>structured as input to other uses.
>
>Can anyone give me some advice?

Most major database vendors have text search addons if you can afford
them. 
If you can't, generate an xref of the products with the words. 
Look at counts of words per product and overall. 
Eliminate frequently occurring "noise" words. 
Then look at correlations between product attributes and words,
concentrating initially on the most and least frequently occurring
words. 
That may give you some ideas on where to go next. 

-- 
Thanks. Take care, Brian Inglis 	Calgary, Alberta, Canada

Brian.Inglis@[EMAIL PROTECTED]
 	(Brian[dot]Inglis{at}SystematicSW[dot]ab[dot]ca)
    fake address		use address above to reply
 




 4 Posts in Topic:
Novice neeeds help: Database with large text fields
Ed Katzman <none@[EMAI  2006-10-26 21:49:11 
Re: Novice neeeds help: Database with large text fields
Brian Inglis <Brian.In  2006-10-27 04:19:32 
Re: Novice neeeds help: Database with large text fields
Ed Katzman <none@[EMAI  2006-10-27 18:30:59 
Re: Novice neeeds help: Database with large text fields
Brian Inglis <Brian.In  2006-10-28 17:39:21 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Thu Aug 21 23:22:59 CDT 2008.