Using R to process Mail Files -
i've done bit of searching , after not finding thought post question. actually, because i've not found much, think may indicator of answer be, anyway...here is:
does have experience using r process files postal mailings...and if so...what packages use?
i realize r might not best tool task have use tools have @ hand , have "extra" things @ work stay employed...so please don't flame me hard question.
basically i'm looking @ merge purge, dup/elim sort of stuff. i've played compare() , merge() commands bit. i'd incorporate equivalencies in compares such as
st=st=st.=street
blvd=blvd=blvd.=boulevard etc...
i'm wondering if packages have been developed sort of data processing i'm not reinventing wheel.
i'd suggest following basic workflow:
(1) read in data. don't know looks based on question, i'll assume that's easy you.
(2) use mix of gsub
, toupper
, , other string manipulation tools convert data same formats. i.e., addresses use st instead of st or street, etc.
(3) merge
single dataframe.
(4) use unique
and/or sort
/order
clean list , remove duplicates.
(5) output whatever format you're going for. again, not clear question, can't offer specific advice here.
Comments
Post a Comment