Posted by Alexandre Rafalovitch on October 16, 2006
David Rothman of the TeleRead fame has written a good background article on the e-books for the Innovate – journal of online education (free registration required).
While David’s articles at his blog are frequent and in-depth, any one of them is too tactical for a good overview. The article at the Innovate is a good summary and is rather more strategic. It also utilises the online nature of the journal to provide a comprehensive set of relevant hyperlinks. There is even a recorded Webcast connected to the article with further discussion on the issue.
(Disclosure: I have previously written an article about e-books for language learning for the TeleRead blog)
Posted in General Education, e-books | Leave a Comment »
Posted by Alexandre Rafalovitch on October 13, 2006
A new Borders bookshop opened near me. Just for fun, I decided to look at their Russian books section (in original Russian). I expected to find none. The result was much stranger.
There were two books on the shelf. One was something non-memorable. The other one, however, was Nobokov’s translation of Alice in Wonderland. So, what’s the problem? Nabokov is well known and Alice in Wonderland is always a great read in any language.
The problem is that Nabokov wrote this book using old Cyrillic alphabet that has letters not in use since 1918. I remember being told of those letters as part of school education, but have never met anyone actually able to read a book using them.
So, given that the book can only be read by researchers studying old Russian, why would Borders chose to use its shelf space for it? Was it trying to out-compete Amazon in the long-tail strategy? Not likely. My guess is that somebody in Borders’ sales office was tasked with chosing Russian titles to put on display and – not knowing any Russian – decided that one cannot go wrong with Nabokov/Alice.
The funny thing is – I was tempted to buy the book, exactly because it was such a non sequitur in the context of maximising profit in the already-failing physical bookstore business. So, maybe this was a brilliant decision after all, given the lack of shelf space to appeal in any other way.
I just wish I knew what really happened there.
Posted in Weird Stuff | Leave a Comment »
Posted by Alexandre Rafalovitch on October 9, 2006
Couple of interesting things happened recently in the Computational Linguistics related fields that I thought were worth linking to:
- ACM Queue had an interview with Mike Cohen of Google (previously of Nuance Communications) discussing recent advances and changes in speech recognition technology.
- Pluggd, with its hotly discussed demo of HearHere, uses speech recognition and some sort of topic clustering to show a time heatmap of your search keyword inside the podcast. The idea is that the heatmap allows you to skip straight to the discussion of the topic you are interested in and ignore parts unrelevant to your interest (and adverts). They have a short presentation about the product in DEMOfall archives. Warning: sometimes it takes a couple of tries to get DEMO video to play (depending on system load).
- PodZinger that already used speech recognition to search within podcasts for search terms, just added an advertising platform that is based on classifying by the content and the search term.
- Netflix has created a challenge where they provide recommendation data, so that other people can try developing an algorithm better than Netflix’s own data mining team. With the big prise of a million dollars (1,000,000$), there is a lot of competitors already. While the dataset provided only has movie titles and therefore not enough to do any text/description analysis, it is still a huge dataset to try various graph and neural network methods on. Most of the people suggest mashing it up with IMDB or some other movie information database, but that obviously requires additional data matching work.
- ClearForest on the other hand is only offering 2000 dollars (2000$) in their competition and you have to bring your own data, but at least they provide an API that does named entities recognition. Beats having to load up GATE every time and, who knows, maybe somebody can create another Gutenkarte-style mashup.
- And to finish on a funny note, maybe you would like the one generated by the STANDUP (popularised writeup): What do you get when you cross a car with a sandwich? A traffic jam.
Posted in Computational Linguistics | 3 Comments »
Posted by Alexandre Rafalovitch on October 9, 2006
Philips recently had a Simplicity event, where they showcased a number of concept products that may or may not make it into the real world in the future.
To go along with the event, Philips also setup a voting board for a number of discussion topics. One of the topics currently under discussion is whether e-books are a good idea. You can pick a side and argue out your position or vote on the arguments of others. At the end of the discussion (3 weeks from now), the results are summarised, based on the vote counts.
I have added my opinion to the forum and pointed to the TeleRead hosted copy of my article on the issue and I invite you to join in the conversation either at Philips forums or in the article’s comments area for your view on the situation.
I believe that the more interesting functionalities we can point out now, the more likely they will be incorporated into the future e-book design. Waiting until e-books are avialable, will lead to those design having just some of the advantages of a paper book, but all the disadvantages of an electronic device.
In fact, Sony’s e-book reader seems to have proven that point already. It does not even seem to have dictionary lookup, something most of the handheld e-book readers provide.
Posted in General Education, Language acquisition, Publishing, e-books | Leave a Comment »
Posted by Alexandre Rafalovitch on October 8, 2006
[This article also appears in a slightly edited form as a TeleRead entry]
Ever tried learning a foreign language? Noticed how the books you could read were often boring, and the books you wanted to read were just that bit too hard to understand? Wished, you could have a quick translation of a complex passage or precise meaning of the word from the spread of twenty that dictionary entry offers?
With paper books, you are pretty much stuck. On the other hand, e-books – with the right combination of software and open formats – may soon prove to be just the solution to keep you reading and learning in the new language. And, with the language learning market attracting billions of dollars, you can be sure somebody will find a way to make the best of the possibilities offered by e-books.
So, what are the advantages e-books can bring to language learning? Let’s start from what is achievable today and progress to the possibilities further down the line.
- Parallel texts – Intermediate and advanced readers appreciate being able to read original text, while still having a good translation available a glance away. Paper books like this do exist, but just a few, due to a high cost of production and distributed target market.For e-books, the ever decreasing price of the storage makes the size of the download irrelevant – slashing the cost of physical production. And with electronic distribution, the market reach is as wide as the internet itself.If you still have doubts, this model is already being exploited extremely well in another multi-lingual market – bible study. There are many bible translations and scholars like to be able to read them side by side to understand the deep meaning better. A number of free eReaders exist to make this task easier, including a portable one for the PocketPC.
- Dictionary bundling – Continuing with the theme of practically unlimited storage, we can easily imagine a book being bundled with a look-up dictionary that is capable of prividing a translation of every word and expression in the text.This is only possible with specially adapted texts at the moment and, even then, only some words and basic phrases are provided.With e-books, it would be possible to embed invisible hints that will show the specific meaning of the phrase in the exact context of the paragraph.Again, something similar to this has been done for Bible study with Strong’s numbers, but, with good dictionaries, the concept can be extended to any text. Many of the current e-book readers allow dictionary lookups, so the basic functionality is already available.
- Grammar learning through real examples – Most of the texts provided for learning grammar are boring and feel artificial. How about being able to choose your own text and have the reader software automatically highlight the structures you are learning this week, whether it is colour names, present perfect constructions or conjugations of the irregular verb ‘to be’? With the material being presented completely in context, the rules will be easier to understand and recall. And even if you are rereading the last week’s passage, you are learning something new, as the highlighted parts will change.
- Automatic text leveling – If we can bundle additional text that does not show up in the book normally, why can’t we have the same text several times with different levels of reading difficulties. That way, a book may contain adapted/simplified text as well as an original one.Then, any number of combinations might be used, depending on whether the reader is connected to other systems or not.For example, if the book is delivered as serialised chapters from a website, there might be grading tests in between chapters with the simplification level of the next chapter adjusted automatically based on the test results. Or it could be a Choose Your Own Adventure type of book, where the language level of the ‘next’ page depends on the language introduced in pages seen before.
This is not something that can be done economically at all for printed books, since they are frozen at the time of publishing, but with e-books the material can be reused in smaller chunks across multiple learners and therefore will allow for a viable publishing model. Especially, if we can throw the increasingly ubiquitous broadband wireless connection into the mix with pages arriving in near-real-time directly to the reader device.
- Text generation – If a ‘Choose Your Own Adventure’ format is suitable for language learning, why not a computer generated book all together. It is nearly possible with current technologies to create a book as a concept graph and have the computer generate the actual text. With a bit more design, the generated text will purposefully incorporate new words and grammatical structures to match the learner’s progression in the learning plan. It would even be possible to dynamically generate grammatical explanations, since the text-creation system has to figure it all out anyway to generate the sentences. Again, combined with feedback from external or on-device tests and near-real-time downloads, the text can always be just ahead of the learner’s own language knowledge.
- Finally, with eBooks’ texts being available in open electronic formats, it would possible to use them to generate additional contexts (such as geographic mapping) or tests based on the exact texts the learner was reading for his or her own pleasure.
There are many more possible uses of texts available in e-book rather than print form. I have just scratched the surface of what is possible.
Nor have I discussed technical details that would make each of these items really tick. Perhaps I will in a future post, if there is enough interest (vote in the comments).
For those curious now, I will just mention two concepts. They are Computational Linguistics and CALL/ATALL. Follow the directions these fields of study point out and your thinking adventures will be more wondrous than those of Alice.
Posted in Computational Linguistics, Language acquisition, Publishing, e-books | 1 Comment »