Worldwide web goes truly global with Arabic | Gulf News

"Arabic Language" in the Arabic Al-B...
Image via Wikipedia

There’s good reason Arabic advertisements are appearing more and more on websites across the globe.

Online, the Arab language is flourishing as the English-language dominated internet slowly gives way to a new multilingual era, promising billions of new e-commerce dollars and a growing sense of electronic cultural self-identity in the Middle East.

“With more users in the MENA region connecting to the internet, businesses of all types and sizes are starting to realise the opportunity to express their brand values through the online marketing space,” said Joanne Kubba, Global Communications and Public Affairs Manager at Google, Middle East and North Africa.

“This is being driven by the growth in users coming online in recent years. Whilst Google does not reveal country specific data, we have seen an increased demand for Google AdWords,” Kubba said.

According to World Internet Statistics, latest recorded figures show that from 2000 to 2009, Arabic language on the web grew 2,297 per cent, easily ahead of Russia's 1,359 per cent.

Of the world’s total 1.8 billion web users, 60.2 million (3.3 per cent) are Arabic speaking.

The mushrooming of Arabic on the internet can be attributed in part to ambitious policies such as those advocated by the United Arab Emirates, which leads the Middle East region with 74.1 per cent internet penetration.

As many as 3.5 million of the 4.8 million residents in the UAE are online.

Since 2000, the UAE has witnessed a 384 per cent increase in internet user growth, during a financial and social renaissance that has propelled the Emirates onto the world stage.

via gulfnews : Worldwide web goes truly global with Arabic.

A recent improvement for Arabic searches | Official Google Blog

We've learned that when performing a search on Google, people sometimes forget to separate words with spaces. Moreover, people often mistakenly repeat a letter within a single word. For instance, when writing the query [amazingly beautiful poem], you might write it as [amazingly beautiifullpoem].

These types of errors are much more common in languages like Arabic, where most of the letters are cursive. That means that the shapes of the letters change, based on the position of the letter in the word (initial, middle, final or isolated). Moreover, some Arabic letters are considered word breaks, meaning that the following letter must be in an “initial” shape. In other words, if the last letter of one word is a word break, the following word may not be separated with a space.

For example, the queries [وزارةالتعليم] and [وزارة التعليم] have an identical meaning (Ministry of Education) and they’re both written in a common form for Arabic documents. But they have different, albeit correct, formats — the first query is written as a single word, while the second is written as two. Google needs to understand that while they’re written differently, they mean the same thing and should yield the exact same search results. In this example, both queries were written correctly, just in different formats. But sometimes people just make errors — like repeating the same letter twice. For example, you might write [راائعة الجماال], repeating the letter “ا” twice in both query words. In this case the correct spelling should be [رائعة الجمال]. It's important that Google search recognizes your query — despite spelling errors.

To address issues like this, we recently developed a search ranking improvement that targets certain Arabic queries. Our algorithm employs rules of Arabic spelling and grammar along with signals from historical search data to decide when to leave out spaces between words or when to remove unnecessarily repeated letters. Now, when you type a query leaving out spaces or repeating a letter, we’ll return better results based not only on what you typed, but also on what our algorithm understands is the “correct” query. For example, here's what happens when you type [قصيدة راائعةالجماال] ([amazingly beautiful poem] in Arabic) with repeated letters and dropped spaces between words.

via Official Google Blog: A recent improvement for Arabic searches.