Senior adviser calls for stronger EU data protection laws – Computerworld

Europe needs strong and effective data protection, the European data protection supervisor said Monday.

Responding to the recent European Commission communication on data protection reform, Peter Hustinx underlined the importance of a clear legal framework that harmonizes national data protection legislation, particularly in societies where private information is widely gathered without individuals’ knowledge.

He also called for a technologically neutral approach, the inclusion of the principles of privacy by design and accountability, and the introduction of a mandatory security breach notification covering all relevant sectors.

“Data protection is not an abstract thing. It relates to everybody’s life, every moment of every day. There is no room for mistakes here: the challenges are enormous. That is why the proposed solutions must be equally ambitious and actually enhance the effectiveness of the instruments of data protection,” he said.

via Senior adviser calls for stronger EU data protection laws – Computerworld.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Use Of Backup Tapes In Computer Forensics | Analogue and Digital

Data in local systems comes and goes and can often be replaced, especially where this is the intention of the business or person being investigated. Back up data information provides a snap-shot of a system or systems and therefore provides a historical record. Therefore if there is an attempt to remove information from a local system and that information was previously stored on a back up system then that information will be able to be recovered within the backup data tape.

Those who specialize in this form of investigation will work back through the backup data tapes and can therefore gain a greater insight into any system abuse or illegal behavior that may have taken place. Unless the person who is attempting to erase information has a great knowledge of the system and erasure techniques then the information that is being sought, if it in fact exists, should be located within the backup infrastructure.

Those conducting the investigation of the data must have knowledge of the backup infrastructure itself. There is likely to be a significant amount of information stored within backup tapes so knowledge of how to process this information to reduce the search time requirements is a key factor. This is especially important relating to cost factors as well as man-power and time to conduct any investigation or audit.

As an example, if there are 3000 tapes that require 3 hours each to read completely and you could use 10 systems with 80% operating time this would mean the required time to read the 3000 tapes would be approximately 50 days. This does not take into account the requirement to actually analyze and organize the data itself.

via Use Of Backup Tapes In Computer Forensics | Analogue and Digital.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Facebook admits its third-party developers have mishandled private data | Web Apps News – Betanews

In what could be potentially damaging to a company already being criticized over its privacy issues, Facebook admitted late Sunday that it had knowledge of developers passing information called user IDs within applications. The user ID is a unique set of numbers that identify users on the site.

Facebook engineer Mike Vernal said in a blog post that in most cases the company believed developers were doing this unintentionally, but regardless it was a violation of the social networking site’s privacy policy. Vernal did however say the press was overblowing the situation.

“Knowledge of a UID does not enable anyone to access private user information without explicit user consent,” he claimed. “Nevertheless, we are committed to ensuring that even the inadvertent passing of UIDs is prevented and all applications are in compliance with our policy.”

The Wall Street Journal said that the issue may affect “tens of millions” of application users, even those who have their privacy settings as strict as is currently possible. Zynga’s popular apps Farmville, Frontierville, and Texas Hold ‘Em all have the issue, it found, among others.

via Facebook admits its third-party developers have mishandled private data | Web Apps News – Betanews.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Facebook Is Secretly Building A Phone

Facebook is building a mobile phone, says a source who has knowledge of the project. Or rather, they’re building the software for the phone and working with a third party to actually build the hardware. Which is exactly what Apple and everyone else does, too.

It was a little less than a year ago that we broke the news that Google was working on a phone of its own – which was eventually revealed as the Nexus One. It was about that time, says out source, that Facebook first became concerned about the increasing power of the iPhone and Android platforms. And that awesome Facebook apps for those phones may not be enough to counter a long term competitive threat.

Specifically, Facebook wants to integrate deeply into the contacts list and other core functions of the phone. It can only do that if it controls the operating system.

Two high level Facebook employees – Joe Hewitt and Matthew Papakipos – are said to be secretly working on the project, which is unknown even to most Facebook staff.

Both have deep operating system experience.

Hewitt helped create the Firefox browser and was working on Parakey before it was acquired by Facebook in 2007. Parakey, which never launched, was described as a “Web-based operating system.” Hewitt also created all of Facebook’s iPhone web apps and then native apps, but finally quit building for the iPhone in disgust late last year. But he knows operating systems and he knows mobile.

via Facebook Is Secretly Building A Phone.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

AT&T, Verizon to Target Visa, MasterCard With Smartphones – Bloomberg

AT&T Inc. and Verizon Wireless, the biggest U.S. mobile carriers, are planning a venture to displace credit and debit cards with smartphones, posing a new threat to Visa Inc. and MasterCard Inc., three people with direct knowledge of the plan said.

The partnership, which also includes Deutsche Telekom AG unit T-Mobile USA, may work with Discover Financial Services and Barclays Plc to test a system at stores in Atlanta and three other U.S. cities that would let a consumer pay with the contactless wave of a smartphone, the people said. The carriers have been searching for a chief executive officer.

via AT&T, Verizon to Target Visa, MasterCard With Smartphones – Bloomberg.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

10 Things that Make Search a Semantic Search | hakia

With the recent article on ReadWriteWeb, it seems like the old debate is back. Are Google’s squared results coming from a real semantic backbone, or is it a good old entity extraction trick anyone, who is capable of copying and pasting lists, could do?

We illustrate 10 points below that define semantic search using our online demo where we compared hakia’s enterprise search system with Pubmed’s search engine, side by side, QDEXing 20 million documents on Pubmed.

1- Handling morphological variations

A semantic search engine is expected to handle all morphological variations (like tenses, plurals, ect.) on a consistent basis. In other words, the results should not change whether you type “improve, improves, improving, improved, improvement”. The example query “improving quality of life” illustrates that hakia results contain morphological variations of the query.

2- Handling synonyms with correct senses

A semantic search engine is expected to handle synonyms (cure, heal, treat,.. ect) in the right context and with correct word senses. For example, the word “treat” can mean doing social favors as in trick and treat, which would not be correct in the medical sense. The example query “is there a cure for ALS” shows that hakia brings results with synonyms with the correct senses. The level of sense disambiguation in a semantic search engine is a sign of its progress.

3- Handling generalizations

A semantic search engine is expected to handle generalizations (disease = GERD, ALS, AIDS, etc.) where the user’s query is expressed in generalized form and the result is expected to be specific. The example query “Which disease has the symptom of coughing?” brings a result set in hakia such that GERD is recognized by the system as the specific answer.

4- Handling concept matching

Perhaps the most challenging functionality among all, a semantic search engine is expected to recognize concepts and bring relevant results (political instability = insurgency, unrest, etc.) Usualy, the depth of this capability is increased in verticals of operation, and it would be unrealistic to expect coverage in all subjects under the sun. The example query “political instability” brings a result set in hakia including concept matching.

5- Handling knowledge matching

Very similar to the previous item, a semantic search engine is expected to have embedded knowledge and use it to bring relevant results (swine flu = H1N1, flu=influenza.) Knowledge match and concept match are similar in principle, yet different in practice in the way the capability is acquired. The example query “swine flu virus” brings a result set in hakia where these kind of matches are visible.

[continued] hakia Blog » Blog Archive » 10 Things that Make Search a Semantic Search.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Being Prepared For E-Discovery – Dashboard Insight

The legal discovery phase of litigation, where companies are required to produce a variety of historical documents and other knowledge that is relevant to a particular legal action, can often be the most important component of said litigation.  Failure to reign in the costs associated with legal discovery (and failures to comply with judicial mandates that require an organization to produce specific documents or correspondence) can result in dire consequences financially.  With the computing revolution of the last 20 years, the focus of legal discovery has alighted on electronically enabled and stored information – more specifically, the electronic discovery (E-discovery for short) of text, voice and video assets of interest.  Electronic information such as emails, instant messages, scanned documents and shared/networked spreadsheet and word processor files now make up the bulwark of legal discovery requests.

Vendors of business intelligence (BI) software have been rushing to market a number of E-discovery solutions that help companies save considerable sums of money when it comes to legal discovery and retention, as they greatly reduce the time spent on satisfying legal demands for documentation on corporate actions and processes.  One vendor of renown that has made great strides in the legal discovery support space is Riverglass Inc., a bleeding-edge vendor of business intelligence software which I have mentioned previously in Dashboard Insight.  Riverglass has identified four critical elements that are vital to the success of E-discovery.  They strategically embedded these elements in their software products; they are reach, relevance, repeatability and reliability.

  • Reach is all about knowing exactly where information resides and having the ability to retrieve that information, many times penetrating hidden pockets of data or deep-searching non-traditional data sources or silos of unstructured information.
  • Relevance means applying some sort of semantic or contextual meaning to information discovery and text searching in order to best support and streamline subsequent analytic tasks.  Deduplication and custom categorization and classification of data will be key drivers of ensuring relevance.
  • In order to achieve a suitable and sustainable return on investment, it is essential that E-discovery actions are highly repeatable—that automated discovery processes can run again and again over time and always produce the desired results with a modicum of variance.
  • Reliable.  E-discovery decrees a high echelon of transparency (of how data is indexed, searched and returned) and an extreme level of knowledge integrity for all search/discovery results.

Lawyers working on litigation cases always try their best to empathize with their clients and understand their core business values and processes.  But they will rarely have the know-how to best manage an E-discovery project from a technological perspective, especially when a legal request for information is not specific enough or there is a huge tome of information (much of it most likely redundant) that must be waded through in order to glean the small nuggets of data that are required to satisfy a discovery request.  Lawyers have never been cheap; they certainly will not become any less expensive in the future.  So it goes without saying that attorneys are probably not the best resources to be scouring corporate messaging applications, voice mails, personal digital assistants, employee laptops and network share areas looking for “needle in a haystack” information.

via Dashboard Insight – Being Prepared For E-Discovery – Page 1.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

The Need For Specialization Among the eDiscovery and Forensic Community « Litscovery

Individuals who are sick will schedule an appointment with a physician; those that face a legal issue will likely confer with an attorney.  The initial selection as to which physician or attorney to choose will be broadly based on the professional’s area of specialty.  Clearly, a criminal matter would best be handled by a criminal defense attorney while a divorce would be routed to a matrimonial lawyer.

This system of professional specialties has proven its effectiveness over the course of a number of decades and continues to be responsive to the growing needs of the present and future by adding, modifying, and removing specific specialties as needed.

Taking a look at the evolution of eDiscovery and computer forensics, it is clear that these fields remain defined very broadly.  Like medicine or law, IT is incredibly diverse and no single individual can be an expert in all facets of the field.  While eDiscovery and computer forensics are merely a small subset of IT, they too can and should be further broken down into more defined specialties.

Technical Knowledge

Although a heavy legal emphasis exists, the fact is that the eDiscovery and data forensics fields are branches of IT.  As such, having the proper technical knowledge to undertake an eDiscovery or data forensics project remains crucial.  Such knowledge may include experience with the particular operating system and hardware/server environment.

Much like one would likely not consult with a podiatrist about a neurological condition, a forensic specialist with analytical experience of end users’ Windows computers should not lead an examination of a mainframe server running a Unix flavored operating system.

Experience With the Subject Matter

It is widely known that eDiscovery and data forensics projects emerge as part of a multitude of legal issues, including administrative, civil, and criminal matters.  With data forensics in particular, the nature of the game is finding relevant data that often times can make or break the case.

The problem, therefore, becomes knowing what type of data to look for and where to look for it.  This will undoubtedly change depending on the type of case: a fraud related investigation will be significantly different from allegations of infidelity in a matrimonial matter.

Fraud is a very complex crime with deep intricacies that a simple keyword consisting of  “fraud” will not necessarily uncover.  In order to conduct a proper fraud analysis, one must have knowledge of the type of fraud alleged to have been perpetrated and the manner in which it took place.  This aids the examiner in understanding what steps may have been taken to facilitate such activity and where such data may reside on the drive.

via The Need For Specialization Among the eDiscovery and Forensic Community « Litscovery.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Twitter as History – Library of Congress Signs Up – NYTimes.com

The Twitter fail whale error message.
Image via Wikipedia

Not everyone would think that the actor Ashton Kutcher’s Twitter musings on his daily doings constitute part of “the universal body of human knowledge.”

But the Library of Congress, the 210-year-old guardian of knowledge and cultural history, thinks so.

The library will archive the collected works of Twitter, the blogging service, whose users currently send a daily flood of 55 million messages, all that contain 140 or fewer characters.

Library officials explained the agreement as another step in the library’s embrace of digital media. Twitter, the Silicon Valley start-up, declared it “very exciting that tweets are becoming part of history.”

Academic researchers seem pleased as well. For hundreds of years, they say, the historical record has tended to be somewhat elitist because of its selectivity. In books, magazines and newspapers, they say, it is the prominent and the infamous who are written about most frequently.

But although celebrities like Mr. Kutcher may have the most followers on Twitter, they make up a tiny portion of its millions of users.

“This is an entirely new addition to the historical record, the second-by-second history of ordinary people,” said Fred R. Shapiro, associate librarian and lecturer at the Yale Law School.

via Twitter as History – Library of Congress Signs Up – NYTimes.com.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare

Yebol’s Semantic Search Provides Human Knowledge | CMS Wire

Search v. Discovery

Sometimes we search for information we know we’re looking for and for that we employ machine-generated knowledge systems, like Google. But sometimes we search to discover information we didn’t know about at all, like we do on Wikipedia, a human-generated knowledge system, that is, a system that is populated by user-generated content.

Discovering information is much more complex because it relies on the creation of conceptual relationships which helps to improve the accuracy of a search by understanding searcher intent. With semantic search, if you search for two or more terms, you will find occurrences of a conceptual relationship, not just the terms scattered within the same document, like traditional machine-generated search engines provide.

Searching and discovering are equally valuable, but understanding what you seek to gain from each can help you know where to go to get the information that you need.

A Multi-Dimensional User Interface

Approximately nine months ago, Yebol launched its beta version of its semantic search engine. This month as they plan to move out of beta, Yebol hopes that its advanced application of algorithms paired with human knowledge can provide the “first truly human-like world's knowledge base.”

Yebol’s multi-dimensional user interface features a categorical tree system, displaying a summary of top sites and categories about any given search term, while also visually displaying results matching user intent. The goal is to let users generate richer, more comprehensive search results displayed on a single page.

via Yebol’s Semantic Search Provides Human Knowledge.

LinkedInPinterestEvernoteWordPressBlogger PostEmailShare