Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

govtrack · GovTrack.us Discussion List

The Yahoo! Groups Product Blog

Check it out!

Group Information

  • Members: 462
  • Category: United States
  • Founded: Nov 3, 2004
  • Language: English
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Messages

Advanced
Messages Help
Messages 330 - 359 of 1188   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#330 From: "Marjorie Roswell" <mroswell@...>
Date: Fri Jun 15, 2007 4:15 pm
Subject: The Plum Book to org chart of presidential appointees?
margieroswell
Send Email Send Email
 
Anyone know of a good source of data on presidential appointees, especially showing who is who's boss?

We can see the plum book here:
http://www.gpoaccess.gov/plumbook/2004/index.html


But:
1. The text file link doesn't work on that page
2. Even when you view the PDFs, you can't see who's boss, or (who's who's boss).

Best Regards,

Margie

#331 From: Josh Tauberer <tauberer@...>
Date: Fri Jun 15, 2007 5:45 pm
Subject: Re: Looking for lists of elected officials
tauberer
Send Email Send Email
 
Mark Dilger wrote:
> Could anyone point me to a list of elected officials
> over time.  I expect that no single list has
> everything I want, but the ideal list would have all
> senators, representatives, presidents, etc. etc. with
> their names, positions, and such going back in history
> as far as possible.  Computer readable lists (xml, tab
> delimited, csv, postgres or mysql dumps) are
> preferable, but I can write the scripts to parse HTML,
> PDF or whatever if I need to.

GovTrack's people.xml file (see the about -> source data page) has all
congresspeople ever elected, based on bioguide.congress.gov.

> I am looking for the same data from all countries.

That's very ambitious!

--
- Josh Tauberer

http://razor.occams.info

"Yields falsehood when preceded by its quotation!  Yields
falsehood when preceded by its quotation!" Achilles to
Tortoise (in "Gödel, Escher, Bach" by Douglas Hofstadter)

#332 From: "Marjorie Roswell" <mroswell@...>
Date: Fri Jun 15, 2007 9:28 pm
Subject: Re: Looking for lists of elected officials
margieroswell
Send Email Send Email
 
Likewise, my inquiry earlier today was looking for presidential appointments. And not just the list (since it's in the "Plum Book" but who actually reports to whom.

On 6/15/07, Mark Dilger <markdilger@...> wrote:

Hi All,

Could anyone point me to a list of elected officials
over time. I expect that no single list has
everything I want, but the ideal list would have all
senators, representatives, presidents, etc. etc. with
their names, positions, and such going back in history
as far as possible. Computer readable lists (xml, tab
delimited, csv, postgres or mysql dumps) are
preferable, but I can write the scripts to parse HTML,
PDF or whatever if I need to.

I am looking for the same data from all countries.

Thanks,

mark

__________________________________________________________
Never miss an email again!
Yahoo! Toolbar alerts you the instant new Mail arrives.
http://tools.search.yahoo.com/toolbar/features/mail/



#333 From: Nancy Berry <oaktree1860@...>
Date: Fri Jun 15, 2007 10:38 pm
Subject: Re: Looking for lists of elected officials
oaktree1860
Send Email Send Email
 
majorie, the chain of commands change so frequently I am not sure there
is a way to know who reports to whom...I know sometimes a person is left
wondering who his/her direct report is from night to morning.
 
Nancy Berry


----- Original Message ----
From: Marjorie Roswell <mroswell@...>
To: govtrack@yahoogroups.com
Sent: Friday, June 15, 2007 4:28:02 PM
Subject: Re: [govtrack] Looking for lists of elected officials

Likewise, my inquiry earlier today was looking for presidential appointments. And not just the list (since it's in the "Plum Book" but who actually reports to whom.

On 6/15/07, Mark Dilger <markdilger@yahoo. com> wrote:

Hi All,

Could anyone point me to a list of elected officials
over time. I expect that no single list has
everything I want, but the ideal list would have all
senators, representatives, presidents, etc. etc. with
their names, positions, and such going back in history
as far as possible. Computer readable lists (xml, tab
delimited, csv, postgres or mysql dumps) are
preferable, but I can write the scripts to parse HTML,
PDF or whatever if I need to.

I am looking for the same data from all countries.

Thanks,

mark

____________ _________ _________ _________ _________ _________ _
Never miss an email again!
Yahoo! Toolbar alerts you the instant new Mail arrives.
http://tools. search.yahoo. com/toolbar/ features/ mail/




#334 From: "Joe Germuska" <joe@...>
Date: Mon Jun 18, 2007 10:19 pm
Subject: GovTrack Machine Tags
germuska
Send Email Send Email
 
Has there been any discussion of a machine tag convention for GovTrack "nodes"?  (see http://machinetags.org/ or http://en.wikipedia.org/wiki/Machine_tag )  If not, is there any interest?

I could imagine a lot of value to the community of GovTrack users tagging del.icio.us bookmarks or whatnot with tags that could be linked back to specific bills.  For example, Senator Dick Durbin just emailed to to let me know that H.R. 2206 has become law.  I hadn't known about it, and there isn't any analysis in Congresspedia (how new is that? I hadn't noticed)...  I wasn't sure how to google my way to finding out more about the bill, but if there were a tag standard, it might help a little.

e.g.
govtrack:bill=h110-2206

I'm still not sure exactly how I feel about Machine Tags as a method, but I like what they aim to do.  They require enough effort that they may never catch on, but it may be a bit of a chicken/egg problem.  Finding good use cases for them could help build the momentum and motivate tools to make the tagging easier.

One could easily choose a different prefix for bills (uscongress:bill=h110-2206) if 'govtrack' seemed to somehow box out other political services.

One could also suggest a format for "people" tags, like govtrack:person=300038 (Dick Durbin)  Not sure if there is any usable unique ID that would make it more "shared" than govtrack.

For people like Dick Durbin, the likelihood of users selecting the same tag intuitively is much higher than for tagging a bill.  There are 14 URLs tagged "dickdurbin" in Delicious, and 31 tagged "durbin".  Maybe people don't really want to tag things about him that much, although there are a lot more for the junior Senator from IL... 

Joe
--
Joe Germuska
Joe@... * http://blog.germuska.com    

"I felt so good I told the leader how to follow."
-- Sly Stone

#335 From: Josh Tauberer <tauberer@...>
Date: Tue Jun 19, 2007 12:36 am
Subject: Re: GovTrack Machine Tags
tauberer
Send Email Send Email
 
Joe Germuska wrote:
> Has there been any discussion of a machine tag convention for GovTrack
> "nodes"?  (see http://machinetags.org/ or
> http://en.wikipedia.org/wiki/Machine_tag
> <http://en.wikipedia.org/wiki/Machine_tag>)  If not, is there any interest?

Hi, Joe.

I'm not a particularly big fan of tags. I see them as a very short term,
unorganized solution to a much larger problem. As time goes on, the
likelihood of cashes of tag names grows and grows (i.e. ambiguity of
what the tag was meant to refer to), and as people try to push more
information inside of a tag, we get stuck trying to parse the
information back out, not knowing whether it was intended or not.

My preferred approach is using semantic web methods, like URIs for
identifying things and RDF for encoding structured data.

> I could imagine a lot of value to the community of GovTrack users
> tagging del.icio.us <http://del.icio.us> bookmarks or whatnot with tags
> that could be linked back to specific bills.

And so from that perspective, I'm not opposed to adding things like that
to the site, but people would have to point me in the right direction
for what to do.

> govtrack:bill=h110-2206

In the ideal world, I would use a globally unique identifier for each bill.

> One could also suggest a format for "people" tags, like
> govtrack:person=300038 (Dick Durbin)  Not sure if there is any usable
> unique ID that would make it more "shared" than govtrack.

There are IDs assigned with the Library of Congress (see
bioguide.congress.gov) which are neutral ground, although they have some
issues like multiple identifiers assigned to the same individual ---
after women change their name (at least in their historical data, which
might just be unintentional).

> For people like Dick Durbin, the likelihood of users selecting the same
> tag intuitively is much higher than for tagging a bill.

And the likelihood that other Dick Durbin's in the world get conflated
with our congressional guy also is higher, which is why I don't like
tags much.

But like I said, I'm not deeply opposed to using them on the website.

- Josh

#336 From: "Joe Germuska" <joe@...>
Date: Tue Jun 19, 2007 2:19 am
Subject: Re: GovTrack Machine Tags
germuska
Send Email Send Email
 
On 6/18/07, Josh Tauberer <tauberer@...> wrote:

Joe Germuska wrote:
> Has there been any discussion of a machine tag convention for GovTrack
> "nodes"? (see http://machinetags.org/ or
> http://en.wikipedia.org/wiki/Machine_tag
> <http://en.wikipedia.org/wiki/Machine_tag>) If not, is there any interest?

Hi, Joe.

I'm not a particularly big fan of tags. I see them as a very short term,
unorganized solution to a much larger problem.

Josh:

Thanks for your response.

I understand your preference for unambiguous semantic annotations.  I am of mixed mind myself, as I feel that the populist potential of tags is important.  The problem is whether machine tags are simple enough that they'll be popularly adopted, or whether they will end up not gaining traction. 

Or, the problem (if one is not particularly populist) may be that users can't be trusted to tag things correctly, even if tags are not considered so ambiguous.  However, if taggers adhere to a contract, there is no reason that any given tag can't be mapped to a URI appropriate for use in an RDF triple subject or object.

So, one could easily say that, within the machine tag convention,
govtrack:person=300038
means exactly
tag:govshare.info,2005:data/us/congress/people/D000563

so that you could read a feed of Delicious bookmarks and construct RDF triples from them that say
(A New York Times article) (is about) (Dick Durbin).  Is "aboutness" interesting enough?  I have trouble believing that even experts are going to make much more meaningful relationships.  [ (A Drudge Report posting) (slanders) (a member of congress) (A letter to the editor) (totally misrepresents) (the bill to outlaw television) ?? ]  But even if they were going to, one could come up with a taggish compression pairing common predicates with well-identified objects. 

I know you don't care for the term "tagging", but I can't think of anything else to call "every day people annotating pages they come across."  Do you see any general use cases for human-mediated structuring of (gardening?) the pool of data that makes up GovTrack?  Right now, everything, or just about, is generated by transforming data files, correct?  Are there places where human intelligence would make something better? 

If there are, then there need to be tools to help -- no one writes RDF with a text editor for very long.  But there's no reason one couldn't use tag autocompletion and human-readable names to make it pretty easy for people to contribute data "from the field," if there's data that is to be desired.

In one sense, this isn't much different from using the Technorati feed of people who have linked to the bill.  But realistically, maybe just the effort devoted to writing a post about a bill vs. tag-and-run ends up being a useful way to keep the signal-to-noise ratio better.

but now I'm starting to ramble...

Joe

(PS Is GovTrack data part of Freebase.com's data pile?)


--
Joe Germuska
Joe@... * http://blog.germuska.com   
"I felt so good I told the leader how to follow."
-- Sly Stone



#337 From: "Bryan L. Fordham" <bfordham@...>
Date: Tue Jun 19, 2007 3:13 pm
Subject: Re: GovTrack Machine Tags
blfordham
Send Email Send Email
 
Personally, I love the idea of tagging, though not necessarily as
something that would be on the front page. It's on my todo list at
critter watch, though admittedly it's pretty far down the list.

But yeah, there are a number of problems with it. And some good things.
I think letting folks tag items, and other users can optionally view
them (or perhaps view the most popular tags for some items) would be a
fun experiment.

--B

#338 From: "Joe Germuska" <joe@...>
Date: Tue Jun 19, 2007 7:22 pm
Subject: Re: GovTrack Machine Tags
germuska
Send Email Send Email
 
Just to be clear

On 6/19/07, Bryan L. Fordham <bfordham@...> wrote:

Personally, I love the idea of tagging, though not necessarily as
something that would be on the front page. It's on my todo list at
critter watch, though admittedly it's pretty far down the list.

But yeah, there are a number of problems with it. And some good things.
I think letting folks tag items, and other users can optionally view
them (or perhaps view the most popular tags for some items) would be a
fun experiment.



Just for clarity's sake, let me point out that I'm not talking at all about GovTrack allowing users to apply tags to GovTrack data.  I don't have a very strong opinion about whether or not GovTrack should support that. 

Take, for example S. 1348: A bill to provide for comprehensive immigration reform and for other purposes GovTrack already shows us ten blogs that have linked to the tracking page for that bill, using Technorati. 

But not all sources will know to (or choose to) link to the GovTrack page.  What if I'm passionate about this issue and I find this article on MSNBC about the bill.  If I knew how to tag it reliably in Delicious, then Josh could set up another feed alongside the Technorati one which would pick up on my recognition that MSNBC was talking about S. 1348, even though MSNBC didn't link back to GovTrack, or even use the bill number in the article.  Of course, with the right convention, this is not limited to GovTrack by any means.  Political blogs of any persuasion could do the same thing.

Now, maybe one of Josh's hesitations is the question about how many taggers in the field would make the correct links, and that's a valid question.  For blogs with more of an editorial position, I think the simple answer is to limit the number of users whose tags are even considered.  And maybe at the end of the day, GovTrack itself wouldn't want to make those choices and wouldn't want to accept a totally unfiltered tag feed (although the Delicious spammers would probably use a lot of other tags first before they start abusing wonky machine-oriented tags).  But since this is a list of people who are generally interested in applying technology to civics, I thought I'd float it...

I hope the above example clarifies my point.  Should anyone be at all interested, I wrote a few things about this general topic that have drifted off the front page of my blog:
http://blog.germuska.com/?p=496
http://blog.germuska.com/?p=497

Joe

--
Joe Germuska
Joe@... * http://blog.germuska.com    

"I felt so good I told the leader how to follow."
-- Sly Stone

#339 From: "tay199" <tay199@...>
Date: Fri Jun 22, 2007 8:15 am
Subject: Need Data... Congressional Districts | Cities | Zips
tay199
Send Email Send Email
 
Hello,

Does anyone know where I can get data that has the congressional
districts, and corresponding cities and zips within each district?

Thanks much,

Taylor

#340 From: "Jeremy Dunck" <jdunck@...>
Date: Fri Jun 22, 2007 1:00 pm
Subject: Re: Need Data... Congressional Districts | Cities | Zips
ralinon
Send Email Send Email
 
districts: http://www.census.gov/geo/www/cob/cd_metadata.html
cities: http://www.census.gov/geo/www/tiger/
zip codes: http://postgis.refractions.net/pipermail/postgis-users/2006-July/012671.html

On 6/22/07, tay199 <tay199@...> wrote:

Hello,

Does anyone know where I can get data that has the congressional
districts, and corresponding cities and zips within each district?

Thanks much,

Taylor



#341 From: Chris Joyner <jchristopherjoyner@...>
Date: Fri Jun 22, 2007 3:53 pm
Subject: Re: Need Data... Congressional Districts | Cities | Zips
jchristopher...
Send Email Send Email
 
http://www.DownSizeDC.org may be somewhat helpful.

tay199 <tay199@...> wrote:
Hello,

Does anyone know where I can get data that has the congressional
districts, and corresponding cities and zips within each district?

Thanks much,

Taylor




NOTICE: Due to Presidential Executive Orders, the National Security
Agency may have read this email without warning, warrant, or notice. They
may do this without any judicial or legislative oversight. You have no
recourse nor protection save to call for the impeachment of the current
President.


You snooze, you lose. Get messages ASAP with AutoCheck
in the all-new Yahoo! Mail Beta.

#342 From: "tay199" <tay199@...>
Date: Sat Jun 23, 2007 4:21 am
Subject: Re: Need Data... Congressional Districts | Cities | Zips
tay199
Send Email Send Email
 
Thanks much for the links to the specific data.

The zip code link leads to another link below that is not working.

http://www.census.gov/geo/www/bob/bdy_files.html

I think it's referencing this page...

http://www.census.gov/geo/www/tiger/zip1999.html

Looks like the most recent zip data is 1999.

The post office charges $900 to get the most recent! :o

Any other ideas?

Taylor



The link for the zip code file

--- In govtrack@yahoogroups.com, "Jeremy Dunck" <jdunck@...> wrote:
>
> districts: http://www.census.gov/geo/www/cob/cd_metadata.html
> cities: http://www.census.gov/geo/www/tiger/
> zip codes:
>
http://postgis.refractions.net/pipermail/postgis-users/2006-July/012671.html
>
> On 6/22/07, tay199 <tay199@...> wrote:
> >
> >   Hello,
> >
> > Does anyone know where I can get data that has the congressional
> > districts, and corresponding cities and zips within each district?
> >
> > Thanks much,
> >
> > Taylor
> >
> >
> >
>

#343 From: Kennieth Goodwin <kenny@...>
Date: Mon Jun 25, 2007 9:31 pm
Subject: Re: Need Data... Congressional Districts | Cities | Zips
kenniethgoodwin
Send Email Send Email
 
Try the 2006 Tiger Census Data, I have it downloaded.  Check here:
http://www.census.gov/geo/www/tiger.  I would send them to ya, but it's
well over 650GB.

#344 From: Josh Tauberer <tauberer@...>
Date: Sat Jun 30, 2007 7:20 pm
Subject: [Fwd: New Registry of State agency databases]
tauberer
Send Email Send Email
 
Thought some might be interested.

- Josh

-------- Original Message --------
Subject:  [openhouseproject] New Registry of State agency databases
Date:  Sat, 30 Jun 2007 11:20:55 -0900
From:  Daniel Cornwall <daniel.cornwall@...>
Reply-To:  openhouseproject@googlegroups.com
To:  <openhouseproject@googlegroups.com>



Hello,

I realize this group is focused on federal level information, but I
thought some of you might interested in a new American Library
Association Government Documents Roundtable project that will attempt to
provide annotated links to public databases produced by the fifty state
governments plus the District of Columbia.

The main project page can be found at
http://wikis.ala.org/godort/index.php/State_Agency_Databases. At this
moment in time, only the Alaska page is close to being finished, but
viewing it will give you a flavor for the proposed project. The Alaska
page is at http://wikis.ala.org/godort/index.php/Alaska.

If you'd like to help, please adopt a state and start adding databases
using the Alaska page as a template. If you don't like the Alaska page,
click on the "talk" tab of
http://wikis.ala.org/godort/index.php/State_Agency_Databases and suggest
a better format.

Otherwise, just watch the space because it will become progressively
more useful. - Daniel Cornwall, Free Government Information,
dnlcornwall@...

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google
Groups "Open House Project" group.
To post to this group, send email to openhouseproject@googlegroups.com
To unsubscribe from this group, send email to
openhouseproject-unsubscribe@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/openhouseproject?hl=en
-~----------~----~----~----~------~----~------~--~---


--
- Josh Tauberer

http://razor.occams.info

"Yields falsehood when preceded by its quotation!  Yields
falsehood when preceded by its quotation!" Achilles to
Tortoise (in "Gödel, Escher, Bach" by Douglas Hofstadter)

#345 From: "John DeBruyn" <jdebruyn@...>
Date: Sun Jul 1, 2007 5:41 am
Subject: RE: [Fwd: New Registry of State agency databases]
john_debruyn
Send Email Send Email
 

Hi Josh:

 

Try this version of the Alaska link without the period at the end, if the one in the message below does not work for you:

 

http://wikis.ala.org/godort/index.php/Alaska

 

That works for me while the one in the second paragraph below didn’t.

 

Also, if the period at the end of the URL for the State_Agencies_Database page doesn’t work, try the one in the next paragraph without the trailing period.

 

Thanks for passing this along,

 

John

 

John DeBruyn Denver CO USA

 


From: govtrack@yahoogroups.com [mailto:govtrack@yahoogroups.com] On Behalf Of Josh Tauberer
Sent: Saturday, June 30, 2007 1:20 PM
To: govtrack@yahoogroups.com
Subject: [govtrack] [Fwd: New Registry of State agency databases]

 

Thought some might be interested.

- Josh

-------- Original Message --------
Subject: [openhouseproject] New Registry of State agency databases
Date: Sat, 30 Jun 2007 11:20:55 -0900
From: Daniel Cornwall <daniel.cornwall@alaska.net>
Reply-To: openhouseproject@googlegroups.com
To: <openhouseproject@googlegroups.com>

Hello,

I realize this group is focused on federal level information, but I
thought some of you might interested in a new American Library
Association Government Documents Roundtable project that will attempt to
provide annotated links to public databases produced by the fifty state
governments plus the District of Columbia.

The main project page can be found at
http://wikis.ala.org/godort/index.php/State_Agency_Databases. At this
moment in time, only the Alaska page is close to being finished, but
viewing it will give you a flavor for the proposed project. The Alaska
page is at http://wikis.ala.org/godort/index.php/Alaska.

If you'd like to help, please adopt a state and start adding databases
using the Alaska page as a template. If you don't like the Alaska page,
click on the "talk" tab of
http://wikis.ala.org/godort/index.php/State_Agency_Databases and suggest
a better format.

Otherwise, just watch the space because it will become progressively
more useful. - Daniel Cornwall, Free Government Information,
dnlcornwall@alaska.net

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google
Groups "Open House Project" group.
To post to this group, send email to openhouseproject@googlegroups.com
To unsubscribe from this group, send email to
openhouseproject-unsubscribe@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/openhouseproject?hl=en
-~----------~----~----~----~------~----~------~--~---

--
- Josh Tauberer

http://razor.occams.info

"Yields falsehood when preceded by its quotation! Yields
falsehood when preceded by its quotation!" Achilles to
Tortoise (in "Gödel, Escher, Bach" by Douglas Hofstadter)


#346 From: Josh Tauberer <tauberer@...>
Date: Wed Jul 4, 2007 1:24 am
Subject: Initial source code release
tauberer
Send Email Send Email
 
Group,

I've publicly posted some of the screen-scraping Perl scripts GovTrack
uses to collect its data. I'll be adding my scripts to the Subversion
repository below over time with the purpose of making it possible for
others to maintain, extend, and improve them.

The scripts are not released under an open source license for a variety
of reasons, among which is the fact that open source licenses don't
enforce the type of open information model I'd want the *fruits* of the
scripts to be covered under. Since wide distribution of the scripts
isn't necessary to benefit from them, an open source license would be
teethless.

The repository is browsable here:
http://razor.occams.info/code/repo/?/govtrack/gather/us

I don't know quite what it would take to make any of the scripts
actually work for you, though. Certainly you'll need a 'data' directory
in the right place, and possibly MySQL tables in the right place, Perl
modules, etc. If you try to get them working and can't, or make
progress, post any notes here, or better yet post patches to make set-up
easier. Also, the code is heavily *un*commented, and well... that's just
the way it will be.

All patches are welcome.

--
- Josh Tauberer

http://razor.occams.info

"Yields falsehood when preceded by its quotation!  Yields
falsehood when preceded by its quotation!" Achilles to
Tortoise (in "Gödel, Escher, Bach" by Douglas Hofstadter)

#347 From: "damianmont" <photoca@...>
Date: Thu Jul 5, 2007 5:59 pm
Subject: Re: Initial source code release
damianmont
Send Email Send Email
 
--- In govtrack@yahoogroups.com, Josh Tauberer <tauberer@...> wrote:
> The repository is browsable here:
> http://razor.occams.info/code/repo/?/govtrack/gather/us

Thank you Josh as always.

Any way you could just document what each file does?

Here's list:
database.people.sql
database.tables2.sql
database.tables.sql
db.pl
general.pl
indexing.pl
parse_record.pl
parse_rollcall.pl
parse_status.pl
personaldb.pl
sql.pl
util.pl

(p.s. EXCELLENT article on XML.com... That's how I found your site)

Also could you maybe send us more or less the pages you scrape?
I'll probably get them from reading the script, but I'm a php guy, not
perl...but a programmer is a programmer right? I should be able to
figure it out.

#348 From: "Joshua Tauberer" <tauberer@...>
Date: Thu Jul 5, 2007 6:27 pm
Subject: Re: Initial source code release
tauberer
Send Email Send Email
 
--- In govtrack@yahoogroups.com, "damianmont" <photoca@...> wrote:
> Any way you could just document what each file does?
>
> Here's list:

Sure.

> database.people.sql
MySQL table schema and data for "people" table of all people that have
served in Congress (name, birthday, etc.), and "people_roles" table
for every role in Congress each person has served (role type
(senator/representative), start/end date, party, etc.).

> database.tables2.sql
> database.tables.sql
MySQL table schemas for other tables that are filled in by the
scripts, mostly for indexing bills. The people tables are the only
ones I edit by hand and are not automatically generated from some
other source.

You'll need to pipe these to mysql, or otherwise load them, for any of
the scripts to work. (The indexing tables aren't strictly necessary if
you disable the indexing routines one way or another, but the people
tables are pretty critical for all of the parsing scripts.)

> db.pl
Utility script for opening the MySQL database.

> general.pl
Really old utility functions that I don't really use.

> indexing.pl
Subroutines that update the indexing MySQL tables based on the
contents of a bill or vote file.

> parse_record.pl
Parses the Congressional Record from THOMAS.

> parse_rollcall.pl
Parses roll call pages from the House and Senate websites.

> parse_status.pl
Parses bill status pages from THOMAS.

> personaldb.pl
Converts a name of a representative into an ID. Considers a date, role
type, and state/district info to disambiguate names when it's ambiguous.

> sql.pl
Utility functions for dealing with MySQL (preparing SQL statements
programmatically).

> util.pl
A ton of utility functions used throughout.

> (p.s. EXCELLENT article on XML.com... That's how I found your site)

Thanks!

> Also could you maybe send us more or less the pages you scrape?
> I'll probably get them from reading the script, but I'm a php guy, not
> perl...but a programmer is a programmer right? I should be able to
> figure it out.

That's a long list. Maybe another time!

- Josh

#349 From: "Jeremy Dunck" <jdunck@...>
Date: Fri Jul 6, 2007 2:16 pm
Subject: Templatemaker: magic screen scraping
ralinon
Send Email Send Email
 
I suspect anyone on this list using Python will be interested in this:
http://holovaty.com/blog/archive/2007/07/06/0128

#350 From: "damianmont" <photoca@...>
Date: Fri Jul 6, 2007 2:43 pm
Subject: Which is the better Bill Screen scraping site
damianmont
Send Email Send Email
 
Can I have your opinion Joshua (and anyone else that wants to chime in)

Your excellent code points to you checking the most recent bill action at:
http://thomas.loc.gov/cgi-bin/query/B?r110:@FIELD(FLD003+h)+@FIELD(DDATE+2007062\
9)
(say for the "house" and for the date: 06/29/2007)

What about looking in the following directory:
http://thomas.loc.gov/home/gpoxmlc110/

They seem to have an update list all in xml format.

Your opinion on it's quality. Had you seen it before? do you think
it'll be there for a while?

#351 From: "Joshua Tauberer" <tauberer@...>
Date: Fri Jul 6, 2007 3:15 pm
Subject: Re: Which is the better Bill Screen scraping site
tauberer
Send Email Send Email
 
--- In govtrack@yahoogroups.com, "damianmont" <photoca@...> wrote:
>
> Can I have your opinion Joshua (and anyone else that wants to chime in)
>
> Your excellent code points to you checking the most recent bill
action at:
>
http://thomas.loc.gov/cgi-bin/query/B?r110:@FIELD(FLD003+h)+@FIELD(DDATE+2007062\
9)
> (say for the "house" and for the date: 06/29/2007)
>
> What about looking in the following directory:
> http://thomas.loc.gov/home/gpoxmlc110/
>
> They seem to have an update list all in xml format.
>
> Your opinion on it's quality. Had you seen it before? do you think
> it'll be there for a while?
>

I happen to write something about that a few days ago:
http://www.theopenhouseproject.com/2007/07/04/legislative-xml-what-we-have-and-w\
hat-were-seeking/

The first part of that post describes the (gpoxml) files you found.

- Josh

#352 From: "tay199" <tay199@...>
Date: Sat Jul 7, 2007 12:16 am
Subject: Re: Initial source code release
tay199
Send Email Send Email
 
YEAH! Thanks Josh. I'll keep the group posted on any patches or things
of value we find and can contribute.

Taylor

#353 From: "Kevin Henry" <k@...>
Date: Sun Jul 8, 2007 11:45 am
Subject: Re: Initial source code release
severian43
Send Email Send Email
 
Great, thanks Josh...

I didn't do much scripting (proper) for http://www.whereabill.org/,
but since Josh has started the ball rolling, I'll add in the one
script I did write: an XSLT file that takes the XML version of Josh's
people database
(http://www.govtrack.us/data/us/110/repstats/people.xml) and extracts
the people/roles relevant to a single specified Congressional session.

The problem (for my purposes, which include sending this information
to the browser client on each request) is that the people.xml file is
large (6.3MB), and includes lots of dead people. :) So I use this
script to get only the information for a particular Congress.

Kevin


bioseparate.xml:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet version='1.0'
xmlns:xsl='http://www.w3.org/1999/XSL/Transform'
xmlns:exsl="http://exslt.org/common">
<xsl:output method="xml" version="1.0" encoding="ISO-8859-1"
indent="yes"/>

<xsl:param name="congress">110</xsl:param>

<xsl:template match="people">
	 <xsl:copy>
	 <xsl:variable name="striprole">
		 <xsl:for-each select="person">
			 <xsl:copy>
			 <xsl:copy-of select="@*"/>
			 <xsl:copy-of select="role[(((number($congress)*2 + 1787) >=
number(substring-before(@startdate,'-'))) and ((number($congress)*2 +
1787) <= number(substring-before(@enddate,'-')))) or
(((number($congress)*2 + 1788) >=
number(substring-before(@startdate,'-'))) and ((number($congress)*2 +
1788) <= number(substring-before(@enddate,'-'))))]"/>
			 </xsl:copy>
		 </xsl:for-each>
	 </xsl:variable>
	 <xsl:for-each select="exsl:node-set($striprole)/person[role]">
		 <xsl:copy-of select="."/>
	 </xsl:for-each>
	 </xsl:copy>
</xsl:template>

</xsl:stylesheet>

#354 From: "damianmont" <photoca@...>
Date: Tue Jul 10, 2007 7:26 pm
Subject: Re: Initial source code release
damianmont
Send Email Send Email
 
Kevin,

Love that www.WhereaBill.org site.
You use the information from josh's xml files?

Love the mashup, very well done.

--- In govtrack@yahoogroups.com, "Kevin Henry" <k@...> wrote:
>
> Great, thanks Josh...
>
> I didn't do much scripting (proper) for http://www.whereabill.org ,
> but since Josh has started the ball rolling, I'll add in the one
> script I did write: an XSLT file that takes the XML version of Josh's
> people database
> (http://www.govtrack.us/data/us/110/repstats/people.xml) and extracts
> the people/roles relevant to a single specified Congressional session.

#355 From: Peggy Garvin <pgrvn@...>
Date: Tue Jul 10, 2007 9:31 pm
Subject: Re: Re: Initial source code release--whereabill
pgrvn
Send Email Send Email
 
I'd like to know, too. I am writing a brief article (right now, due tomorrow) about some of the new legislative info projects and want to mention whereabill as well as a sample of sites that have used Govtrack's file.

Thanks,
Peggy Garvin
peggy -at- garvinconsulting.com


damianmont <photoca@...> wrote:
Kevin,

Love that www.WhereaBill.org site.
You use the information from josh's xml files?

Love the mashup, very well done.

--- In govtrack@yahoogroups.com, "Kevin Henry" <k@...> wrote:
>
> Great, thanks Josh...
>
> I didn't do much scripting (proper) for http://www.whereabill.org ,
> but since Josh has started the ball rolling, I'll add in the one
> script I did write: an XSLT file that takes the XML version of Josh's
> people database
> (http://www.govtrack.us/data/us/110/repstats/people.xml) and extracts
> the people/roles relevant to a single specified Congressional session.



#356 From: "Kevin Henry" <k@...>
Date: Wed Jul 11, 2007 11:30 am
Subject: Re: Initial source code release--whereabill
severian43
Send Email Send Email
 
Peggy and Damian,

Thanks, glad you like the site.

I'm getting all the data from govtrack. Specifically, I'm using the
following files:

- the bill status data (www.govtrack.us/data/us/*/bills/*.xml)
- the roll vote data (/data/us/*/rolls/*.xml)
- the people database (/data/us/110/repstats/people.xml)
- the popularity listing (/data/us/bills.technorati.xml)
- the search service (/congress/billsearch_api.xpd)

I keep a copy of all the files on my server, and do a daily rsync (as
Josh describes here: http://www.govtrack.us/source.xpd) to stay current.

Basically, when the server gets a request for a certain bill, it
retrieves the status data and goes through the action items, parsing
them into the steps that will be represented in the "driving
directions". It also retrieves any relevant roll vote data, and then
sends that information (along with the summary, the titles, the list
of sponsors, and the biographical data for that session of Congress)
back to the client, which renders everything (with the help of the
Google Maps API).

So it's really a (sort-of) UI sitting on top of govtrack's (sort-of) API.

Let me know if you need any more information...


Regards,
Kevin
http://www.whereabill.org/

#357 From: Nathan Hawks <nhawks@...>
Date: Tue Jul 24, 2007 5:25 pm
Subject: Howdy doo from the newbie
perly_white
Send Email Send Email
 
Hi all,

Just wanted to drop a quick line of introduction.  I was turned on to
the list at Josh's recommendation. I am a 30-year-old PHP programmer and
writer living in Akron, Ohio.

Once upon a time I tried to start a site that treated the various
Indymedias as news wires, but after a few months (and this was before
RSS was as well-developed as it is today) I burned out and quit.

I have a few ideas on an issue-tracking site similar to GovTrack and
soonishly, I'll formulate them into a proposal for open source
development.

Good to meet you all!

Nathan

#358 From: "tay199" <tay199@...>
Date: Thu Jul 26, 2007 8:53 pm
Subject: I've Got An Ugly Map... Can You Help?
tay199
Send Email Send Email
 
Hello,

Alaska is taking over my map!!!
http://72.32.197.213/govit/Pages/search/mapchartforgovleader.aspx

I'd really like to decrease the size of Alaska, and move Alaska and
Hawaii under the mainland.

My boundary files are from... http://www.census.gov/geo/www/cob/cd110.html

Can anyone help me by telling me how I can adjust the size and
locations of Alaska and Hawaii?

Thank you,

Taylor

#359 From: "tay199" <tay199@...>
Date: Thu Jul 26, 2007 8:55 pm
Subject: Re: Howdy doo from the newbie
tay199
Send Email Send Email
 
Howdy,

Nice to meet you too. My name is Taylor and I'm a web project manager
from San Francisco. That means I was not a good enough coder to be a
programmer, or an artist to be a designer. ;)

Looking forward to hearing about your project.

Cheers,

Taylor

Messages 330 - 359 of 1188   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright © 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help