Monday, October 15, 2007

Media Software Engineer XML LAMP PHP at ARCHIVE

The Internet Archive is a non-profit digital library committed to preserving the world's digital cultural artifacts. Used by over 6 million people, this resource is becoming part of how the Internet works. Our job is to put the best humanity has to offer within reach of students, educators and the general public. Find out more about our organization and web archive at www.archive.org. We are located in San Francisco in the beautiful Presidio. We are seeking a media software engineer to work closely with our team to expand our collections of videos, audios, software, and educational content.

Primary Responsibilities:
  • Convert third party metadata to XML metadata format.
  • Import large (multiple terabytes) datasets.
  • Extract and reorganize third party datasets from remote systems via ftp, http, rsync, and NAS.
  • Develop, deploy, and maintain software that enables Archive employees and partners to filter, organize, and manipulate their archived content.
  • Audit archive contents and provide interested parties with summaries of data volume, usage, and loss.
  • Maintain Data Collection Department's existing codebase.

Responsibilities Convert third party metadata to XML metadata format.
  • Import large (multiple terabytes) datasets.
  • Extract and reorganize third party datasets from remote systems via ftp, http, rsync, and NAS.
  • Develop, deploy, and maintain software that enables Archive employees and partners to filter, organize, and manipulate their archived content.
  • Audit archive contents and provide interested parties with summaries of data volume, usage, and loss.
  • Maintain Data Collection Department's existing codebase.
Required Skills
  • Current experience with LAMP, PHP5 and Perl.
  • Current experience with XML and UTF-8.
  • Familiarity with UNIX (Linux), including filesystems, process management, and shell operation.
  • Experience with programmatically interacting with (wrapping) web-based interfaces.
  • No fear in the face of large datasets (hundreds of terabytes).
  • Effective communication skills with geeks and non-geeks alike, both inside and outside the Archive
Preferrede Skills
  • Understanding of audio and video container formats and codecs
  • Experience with open source tools for manipulating multimedia data (ffmpeg, faac, et al)
  • Experience with XSLT
  • Experience with SSH
  • Experience with Samba
  • Experience developing software for clustered systems
Education BS or advanced degree in computer science or equivalent experience
Sorry, but no telecommuting.

We are an equal opportunity employer. Please send your resume and cover letter to jobs@archive.org with the subject line "Media Software Engineer". The Archive thanks all applicants for their interest, but advises that only those selected for an interview will be contacted. No phone calls please.

Link

No comments: