Our goal was to create a tool that would enable nontechnical users to quickly and easily gather data while browsing the web, allowing them to collect information directly from a web page without having to leave the browser. The user should be able to guide the extraction process, but the bulk of the work should be done by a content aggregation engine with the user only having to confirm or correct the assumptions made by the best-effort predictions of the system. For more advanced and technically-inclined users, it should however be possible for them to specify custom extraction rules.