Spilled Milk Update #11
Sent: Mon 3/6/00 11:44 PM
Quick Reference:
1.
Whats
New?
2. This
update on the web (nicely formatted).
Hey Everybody,
First
of all Id like to welcome a whole bunch of retailers that responded to my
e-mail. I apologized in advance for
the spam,
but I hope that it was worthwhile and they will stick with me until Im
finally published. I figure were all in this together and I hope that the
final product is enjoyable and possibly educational.
Ive
gotten some questions regarding the status of the project.
Ive been bad about updating that section on the site, because not a
whole lot has changed since the holidays. Im
really waiting on Jason to finish up his pencils.
Hes been trying to get them done while changing jobs and mostly
putting in a few hours each night. Although
I would think that would equate to at least a page a week, but he puts in a ton
of detail so it takes a lot longer. I
know Im able to spend about an hour or two each night to work on
miscellaneous stuff now that my son is going to bed around 8:00 or 9:00 each
night.
Photoshop 5.5
This program rules!.
I had been stuck using Photoshop 3.0, but I was okay with it only because
I went to elementary school with one of the guys on the team, David Corboy.
Im not sure if hes still at Adobe, but wherever he is Im sure
hes doing quality work. I
couldnt believe I was okay with having to make about 20 different back-ups of
each image I worked on. This
version supplies a history window that is awesome for correcting mistakes or
just going through what if scenarios.
The text manipulation is a million times better than 3.0.
I dont have to accompany each file with an associated Word file that
contains the text that I want to put into Photoshop.
In the old version you couldnt edit the text; you had to erase and do
it over and hopefully you put it in its own layer.
Now version 5.5 does that for you. Its
about time too. I think this
will allow me to letter the rest of the issues with relative ease.
Huh! Sorry about
that
DK 64 on the brain.
Promotional Cards
Ive only
gotten two responses from retailers saying they want a few cards to put in hold
slots (you know who you are, thanks guys!).
First I need to know if you would like some, and second I need to know
how many you need. Ill get some
out to you as my supplies last.
The Ashcan
Im getting
quotes right now from North Suburban (here in Denver) and Brenner (in Texas). I know I need to do the
entire first issue for the retailers so they can determine the quality of
the book. My guess is that it will
be a gray scale version of what you see out on the site.
Ill have to put in a pinup or two to round things out two a multiple
of 4. Thats 32 pages if you do
the math and you dont count the cover.
Much of
this update will be geared towards promotion and mailing lists.
If you dont want to learn how to do mailing lists, etc. then Ill
see you in Update #12.
Promotion
Using The Master List
Again, Id like
to thank Robert Scott for pointing me to The
Master List, which is maintained now by Mark Adams.
I still dont know whether to thank Rob or to kill him. Ive definitely putting in some late nights trying to get
The Master List into a workable form.
Once I
got the website address from Rob I checked it out.
WOW! Theres a ton of data
out there! I kind of got my hopes
up, because I thought there would be a spreadsheet or database that I could use
to produce some mailing labels quickly. I
contacted the web-master and Mark replied with Ive been meaning to do
that. I was completely blown
away that he did all of this BY HAND. I
commended him on his efforts and his ability to deal with the tedium.
Mark sent me an e-mail list of about 400 retailers.
Unfortunately that was in HTML format and wasnt easily
mail-merge-able. I knew I had a few hours of coding (programming) to do before
Id have a workable database where I could start generating mailing labels.
I
figured Id start with the e-mail list and then work my way to the address
data hidden on the website. Granted,
the data isnt really hidden; its all there, but locked away in HTML pages.
I already had a program that searched an e-mail file and pulled out
e-mail addresses. You know that
spam that you get with thousands of e-mail addresses in it.
Dont you want to tell these people to get a clue.
Yeah, well I did that a few times. I
got a few thank yous and I got many how did you get my e-mail
address?. Microsoft or Disney isnt giving away money.
Someone is, however, collecting e-mail addresses and Im sure
theyve written programs like mine to pull out e-mail addresses once that
message circulates back around to somebody that cares.
Anyway, I already had code that scanned files and I modified it to look
for certain HTML tags within the file Mark sent me.
Eventually I was able to pull out the Store Name, City, State/Province,
and E-Mail address from the file and write it out to a | delimited file.
Some of the store names needed to be conditioned a little, because of the
way the & is represented in HTML (&).
I was
then able to take that file and import it into Microsoft Access 2000.
I created a couple of new fields for that table such as DateSent,
DateReplied, DateFollowedUp, IsValid, etc.
I then borrowed some code that I wrote for my comic database.
It was code that sent out e-mail based on a list of criteria.
I coded up my initial introductory e-mail and then hit the spam
button. I felt bad doing it,
because I hate getting spam, but I felt that this would be information worth
having. Also since I got the e-mail
addresses from The Master List so I felt exonerated, because thats what the
list is for.
The new
columns in the table were helpful in keeping things organized when I received
the fatal SysAdmin replies. When
I got those, I knew the address was bad and I marked them as being not valid.
Some replies were valid, but they didnt want to be on the list, but
most replies were filled with words of praise and willingness to help out.
I was pretty bowled over by the responses.
Stores with + Response:
61
Stores with Response:
4
e-mail
bounced:
39
No response:
320
Total e-mail
addresses:
426
I
know Im missing two addresses somewhere, but Im not sure where.
I let
that message simmer for about a week, before I sent out the follow up.
I knew MS Word could do an e-mail-merge so I thought Id give that a
shot. I had tried it before, but it
was with the old version of Word. As
the responses were coming in I entered their addresses or phone numbers into the
database. I knew I didnt want to use Outlook to manage my Contacts,
because attempting that from home would be extremely slow over a 28.8 modem.
I extracted the data I needed from Access and gave it to Word as my
mail-merge data source. I was able
to use Word to type up a nice follow-up, instead of trying to code up a response
and send it through Access. The
mail merge worked like a charm and it was fast too, with no failures (not like
my attempt in Access, which could only send about 25 at a time before it
completely bailed and I had to restart). Thanks
Microsoft!
I
havent figured out how to take data from an Access database and write it into
the Outlook Contact database, but that will be next on my agenda. I hope to have a full-blown Outlook database here really
shortly.
Scraping-Up
Is Hard To Do?
NAH!
Once youve got the code in place, the rest is just tedious.
The next step was to get the data from the site into an Access database
so I could use it for mailing labels. I
had written a program a while back that will drive another program using a set
of commands. You can use the old
SendKeys command in MS Visual Basic and send keystrokes to a program as if
you were typing them in yourself. I
had made the program read a file of commands and then those keystrokes would be
sent to a program such as Lview (a public domain image editor).
I could tell Lview, through this set of commands, to read in a set of
files and re-format them and then save them out again.
This is great when youve got a huge set of scans that you might have
made at 150 dpi and you want to shrink the images so theyre the same size,
but a different resolution, say 75 dpi. I
have also been able to use this program to navigate the web using the MS
Explorer.
I
thought I would give that a shot. Sure
enough, The Master List website can be navigated completely with the keyboard.
With a couple of well-placed down-arrows, backspaces, and <ALT>-F-A
sequences I could save the entire site to my hard drive.
I had to tweak the duration between navigation, but I didnt care how
long it took because I could just walk away and about 50 meg later the site was
there (maybe 50 meg is an exaggeration, but there were quite a few images too). I didnt really need the images so I trashed them.
Once
all the pages were downloaded and saved, I was able to start the parsing
process. I modified the code, which
parsed the e-mail file (above) only slightly, and I had data pretty quickly.
There were almost 88 files to parse so I modified the program again to
read in all the files in my download directory.
When all was said and done, I had | delimited file which was ready
to be brought into Access. The file
I created wasnt perfect and had some blank rows in it, but I wrote a few
queries in Access to take care of those. Now
that the data is in a workable form, Ive got 8,310 addresses at my disposal.
Unfortunately not all of them were complete and many of them are only
card shops. I wrote a few more
queries to pick out the names that have comic or comix in them.
I assumed the names with the word card in them, but not comic
were card-only shops. I needed to narrow my search down and I can sweep the rest
later.
Hard
Data
So thats
pretty much it as far as data acquisition goes.
Dont ask me for a copy of any of the data. Youll want to go through Mark @ The Master List.
Ill be passing everything I do over to him.
Once Ive cleaned up the data, I plan on normalizing the database and
Ill add some columns that Mark has asked for, so he can keep track of the
contacts for each of the shops. Im
still not sure if Ill make the source available, because Im guessing that
Mark wont want other people to do what I did.
He could end up doing what SwitchBoard did and hide their zip codes under another mouse click.
Talk about tedious! But if
you can mimic the way a person navigates the web, then the data can be acquired
all in due time. Thanks a lot Mark
for maintaining that site. I know a
little recognition goes a long way
Thanks
for all your support,
_______________________________
Parker D. Smart
President
- Spilled Milk, Corp.
psmart@SpilledMilk.com
http://SpilledMilk.com
The standard
removal policy if you should ever need it.
Just reply to this message with the word remove in the subject and you'll
be deleted from the mailing list. You
might make it in ALL CAPS or else I might not take you seriously J.