O'Reilly Answers is a community site for sharing knowledge, asking questions, and providing answers that brings together our customers, authors, editors, conference speakers, and Foo (Friends of O'Reilly). More »
Do you have a problem that could be solved by iterative optimization? With this excerpt from Programming Collective Intelligence, Toby Segaran explores the use of genetic algorithms to solve such prob...
Asked by DaveEveritt : 20 February 2010 - 02:41 AM
Answered by adfm : Mar 10 2010 12:05 PM
Deployment and maintenance can be a major time suck. It seems like you already have a good idea of what's out there, but are looking for a path to the next level. I would suggest finding a decent ... full answer >
Tagging follows tokenization in the typical natural language processing pipeline. In this excerpt from Natural Language Processing with Python the authors introduce a part-of-speech tagger. The exampl...
Answered by paulbarry : Feb 19 2010 01:22 PM
My understanding is that HL7 v3.0 is XML-based, so you could start with a simple XML parser and see how you get on? Here is one for Python 3 that claims to be easy to use. It's as good a start a... full answer >
If you're creating a search engine you'll need a way to collect documents. In this excerpt from Tony Segaran's Programming Collective Intelligence the author shows you how to set up a simple web crawl...
Collaborative filtering techniques enable online retailers to recommend products, services, and media for the majority of commercial websites you visit daily. When you purchase something at Amazon....
Answered by adfm : Feb 05 2010 10:50 AM
If you are interested in comparing how others have accomplished manipulating an Excel spreadsheet using each language, then I would refer you to this article about reading and writing Excel files with... full answer >
There is a variety of software (including the Unix command-line
utility uniq and Windows PowerShell
cmdlet Get-Unique) that can help you remove duplicate lines in a file or
...
If you've used lower-level languages such as C or C++, you know that much of your work centers on implementing objects—also known as data structures—to represent the components in your applica...
If you want to check whether a certain string represents a valid IPv4 address in 255.255.255.255 notation, try one of these examples from Regular Expressions Cookbook:
Simple regex to check fo...