Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Data Bases > Microsoft Access > Re: How to achi...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 4 of 23 Topic 30626 of 31576
Post > Topic >>

Re: How to achieve scalability

by Tom van Stiphout <no.spam.tom7744@[EMAIL PROTECTED] > Jul 1, 2008 at 10:02 PM

On Wed, 2 Jul 2008 00:45:11 +0000 (UTC), "Carlos Nunes-Ueno"
<sullascl@[EMAIL PROTECTED]
> wrote:

Was that a real example? It seems difficult to come up with consistent
rules that convert "The Yummy and Tasty Waffle Cor****ation" into
"TASTYWAFFLE". Something like "Take the 4th and 5th word, and omit the
spaces"? Can you tell us what kinds of rules you're applying for the
conversion?

I have had good success with the Ratcliff/Obershelp algorithm that
returns a similarity (a number between 0 and 1) between two strings. I
checked and for your two company names the similarity is 0.75. Using
some cutoff value you can narrow down the most similar companies and
bunch them up that way.
We recently implemented this algorithm as a .Net assembly in SQL
Server 2005, and it is very fast. 10,000 comparisons in way less than
1 second.

I'm not at all convinced RegEx is the ticket here.

-Tom.


>I have a fairly large table (700,000 rows or so) that I'd like to run a 
>process.  However, the procedure we have right now was designed with 
>tables of more like 20,000 rows and isn't able to handle it.  Access will

>always crash before it can complete.
>
>Some background: the procedure is used in the process of data cleanup and

>is designed to process company names into a standardized form, so that we

>can use it to confirm data across various datasets.  The procedure takes 
>input like, for example, "The Yummy and Tasty Waffle Cor****ation" or 
>"Yummy & Tasty Waffle, Incor****ated" and turns both into "TASTYWAFFLE".  
>We can then sort, link, and filter, etc. on this field along with others 
>to see if there are duplicates or check if companies that have different 
>IDs are in fact the same company.
>
>Specifically, another procedure takes a specified table and field and 
>creates a new field, filling it with the contents of the original field. 
>That procedure then p***** a DAO recordset and the name of the new field 
>to the main procedure which then performs 11 operations to arrive at the 
>"TASTYWAFFLE" stage.  Filling the new field is clearly double work and 
>I'll be triming that part out.
>
>Now for my specific questions: Currently, the procedure takes the whole 
>contents of the field at once, and uses a lot of InStr, Mid, Left, and 
>Right functions to perform all of the operations, then moves on to the 
>next row.  It seems more direct to just read character by charater until 
>I have a complete word (i.e. I hit a space or other delimiter), process 
>that bit, then move on to the next part of the field.  Which of these 
>approaches is more efficient?  Also, where could I go to find some 
>guidelines on writing the most scalable VBA code?  I know Access has 
>limitations, but I'd like to be limited by those and not by our own 
>inefficiencies.
>
>Thanks in advance,
>
>Carlos
 




 23 Posts in Topic:
How to achieve scalability
"Carlos Nunes-Ueno&q  2008-07-02 00:45:11 
Re: How to achieve scalability
lyle fairfield <lyle.f  2008-07-01 18:28:36 
Re: How to achieve scalability
"Carlos Nunes-Ueno&q  2008-07-02 02:05:28 
Re: How to achieve scalability
Tom van Stiphout <no.s  2008-07-01 22:02:27 
Re: How to achieve scalability
"Carlos Nunes-Ueno&q  2008-07-02 15:59:43 
Re: How to achieve scalability
"Tony Toews [MVP]&qu  2008-07-03 02:34:49 
Re: How to achieve scalability
The Frog <Mr.Frog.to.y  2008-07-02 00:40:42 
Re: How to achieve scalability
Tom van Stiphout <no.s  2008-07-02 20:49:02 
Re: How to achieve scalability
"David W. Fenton&quo  2008-07-03 22:35:23 
Re: How to achieve scalability
The Frog <Mr.Frog.to.y  2008-07-02 01:35:21 
Re: How to achieve scalability
lyle fairfield <lyle.f  2008-07-02 03:06:22 
Re: How to achieve scalability
"Carlos Nunes-Ueno&q  2008-07-02 17:07:18 
Re: How to achieve scalability
Tom van Stiphout <no.s  2008-07-02 21:07:42 
Re: How to achieve scalability
lyle fairfield <lylefa  2008-07-03 16:25:34 
Re: How to achieve scalability
lyle fairfield <lylefa  2008-07-03 23:00:50 
Re: How to achieve scalability
Salad <oil@[EMAIL PROT  2008-07-02 10:36:14 
Re: How to achieve scalability
lyle fairfield <lylefa  2008-07-03 16:26:56 
Re: How to achieve scalability
The Frog <Mr.Frog.to.y  2008-07-03 00:11:22 
Re: How to achieve scalability
Salad <oil@[EMAIL PROT  2008-07-03 07:15:10 
Re: How to achieve scalability
lyle fairfield <lyle.f  2008-07-03 07:43:01 
Re: How to achieve scalability
The Frog <Mr.Frog.to.y  2008-07-04 00:32:24 
Re: How to achieve scalability
lyle fairfield <lylefa  2008-07-04 14:42:46 
Re: How to achieve scalability
The Frog <Mr.Frog.to.y  2008-07-07 03:19:39 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Tue Dec 2 23:20:05 CST 2008.