HOME

Archive for the ‘Data Warehousing’ Category

Is MDM Dead?

Wednesday, March 3rd, 2010

By Mike Shultz, Infoglide Software CEO

Andrew White of Gartner recently posed a question about whether master data management (MDM) is dead. He didn’t actually suggest that the demise of master data management is imminent. He was challenging whether our current terminology adequately clarifies the current reality about MDM and associated product areas.

Certainly the terms describing many markets and types of products are being associated with MDM. Jackie Roberts of DATAForge pointed out that the definition of MDM now seems to include “data integrity, data quality, entity resolution, matching, data integration, governance, metrics and analysis.”

While entity resolution was mentioned in her list, our obsessive focus on entity resolution (aka identity resolution) leads to the conclusion that, rather than being subsumed, its role is growing. Wayne Eckerson at TDWI seems to agree that identity resolution is a critical component of the recent MDM acquisitions. In his post about the acquisitions by Informatica and IBM of Siperian and Initiate Systems, respectively, he described the two transactions this way:

“You could say that Siperian is mostly MDM, but with identity resolution and other capabilities, whereas Initiate is mostly about identity resolution, but with MDM and other capabilities.”

Identity resolution is becoming an integral part of many product areas. Within MDM itself, creating a single-entity view is best done with an identity resolution engine. Data mining is greatly enhanced by the addition of entity resolution. Dan Power of Hub Solution Designs wrote about how key identity resolution is to data matching. We’ve talked about how social CRM can resolve identities of individuals across multiple disparate data sources using identity resolution, as well as “rationalize multiple variations and errors and anomalies that block finding existing customers within their systems”.

Although identity resolution technology has been years in the making, it has only recently risen into the consciousness of most analysts and customers. Because of its ability to bring enhanced clarity to ambiguous data, advanced identity resolution is now beginning to have a significant impact across many data-centered disciplines.

Identity Resolution Daily Links 2010-03-01

Monday, March 1st, 2010

By the Infoglide Team

IT-Director.com: The Informatica Event

[Philip Howard] “To begin with, the company talked about its acquisition of Siperian. I have already commented on this but one point that emerged at the conference was the way that Informatica describes Siperian as infrastructure MDM as opposed to application MDM. This is a hitherto unrecognised distinction (with respect to terminology) in the MDM market. Informatica distinguishes the former from the latter by saying that infrastructure MDM is domain and data model independent.”

Workforce Management: Medical Clinic Owners Plead No Contest to $60 Million Workers’ Compensation Fraud

“Investigators alleged that the pair purchased thousands of workers’ compensation client referrals from an attorney television advertising service. Clients were then sent to doctors who had a relationship with Premier, which would handle billing and collection work in return for a 50 percent fee for money they collected. Clients were then sent to attorneys who had a business relationship with Fish and Bacino, investigators allege. ‘Getting kickbacks for referring medical payments is illegal and drives up the costs in the system,’ California Insurance Commissioner Steve Poizner said in a statement.”

SignalScape: DC Police Chief Cathy Lanier Describes How Technology Is Changing Police Work in the Capitol

“The MPD also established a fusion center, which is responsible for the national capitol region. From a homeland security perspective, Chief Lanier said that the center collects and stores crime and terror alerts into a data warehouse.”

Injured Workers’ Law Firm Blog: Insurance Fraud Is a Huge Crime

“The fraudulent claims that can be made through insurance companies are categorized as being soft or hard. Soft fraud is the most common type of fraud and usually takes place when someone exaggerates a claim being made. Hard fraud takes place when someone deliberately plans a deceptive act such as a collision or the theft of their vehicle.”

Identity Resolution Daily Links 2010-02-13

Saturday, February 13th, 2010

[Post from Infoglide] Architectures for Entity Resolution

“In the last post we looked at a formal model for describing entity-based integration. Now let’s turn our attention to how entity resolution (ER) systems are actually implemented.  One of the most important design decisions is whether the system will perform entity identity management.  Systems perform identity management when they create and store the attributes values for the identities that they process.”

tdwi: IBM and Informatica Acquire MDM Capabilities

“The two acquisitions focus the spotlight on two of the hottest functions today, in terms of user organizations adopting them, namely: MDM and identity resolution. More than ever, organizations need trusted data, in support of regulatory reporting, compliance, business intelligence, analytics, operational excellence, and other data-driven requirements. MDM and identity resolution are key enablers for these requirements, so it’s no surprise that two leading vendors have chosen to acquire these at this time.”

PoliceGrantsHelp.com: Building fusion centers for the next decade

“Serrao says that in the time he has spent in a dozen different fusion centers in the United States — coupled with his own background in law enforcement — he’s gleaned several ‘best practices’ for consideration. Ideally, he says, leadership should ’set a specific strategic mission before the center is even built. Everything else follows. Determine the role of the center and whether strategic intelligence analysis will be part of the mix. Then, it will be easier to define what processes will be developed, what reporting mechanisms are needed, what technology is appropriate, and what types of personnel are needed.’”

Prudent Press Agency: Kansas Takes Action Against Lottery Fraud

“The state of Kansas has been conducting sting operations to prevent this kind of theft by lottery terminal clerks. Law enforcement agents fanned out across the state and presented ‘winning’ tickets at several retail lottery outlets. In five separate cases clerks told the agents the tickets were worthless and then tried to redeem the ‘winning’ lottery tickets. The undercover investigation led to charges of attempted theft and computer crime against five people across the state.”

Identity Resolution Daily Links 2009-12-19

Saturday, December 19th, 2009

[Post from Infoglide] Data Fatigue

“Four years ago this week, a small aircraft lifted off from Watson Island in Miami. It was the plane’s 39,743rd flight. And as the tiny craft first vented white smoke and then lost its right wing in an explosion, it became clear that this was its last. All twenty people in the Grumman G73-T, including three infants, perished. The National Transportation Safety Board later determined that the culprit was metal fatigue.”

ovum: BI, EPM and EDW trends to watch out for in 2010

“For the mid-market and those new to BI, open source and BI software as a service (SaaS) will offer attractive alternatives. In the case of BI SaaS, increasing deployments of enterprise applications in the cloud by SMEs will act as a further driver for take-up of this option.”

destinationCRM.com: Electronic Health Records Get a Check-Up

“Hildreth references a 2009 New England Journal of Medicine survey indicating that close to 4 percent of physicians have a fully functional EHR system. About 13 percent of physicians’ offices have a basic EHR system in the works. Many organizations, Hildreth says, currently have bits and pieces of EHR, but not the full thing.”

insurancenewsnet.com: Hard-up Investigators Battle Against Rise In Comp Fraud

“While prosecution of various forms of insurance fraud is affected by budget constraints, the prosecution of underreporting of workers comp premiums by unscrupulous employers, or their outright failure to purchase the mandated coverage, may take the biggest hit, depending on each state’s priorities, Mr. Jay said.”

intelligent enterprise: Survey: BI Still Hindered By Technical Problems

“Specifically, the 2009 survey found that 29% of BI deployments were slightly successful and 47% were moderately successful. Only 21% of the respondents rated their deployments very successful.’A number of technical factors continue to contribute to — or hinder — stronger BI impact,’ the report said. ‘Data quality, reliability of the BI system and access to relevant data are the most important technical factors.’”

Identity Resolution Daily Links 2009-11-13

Friday, November 13th, 2009

[Post from Infoglide] The Big Story: Evolution

“Technology writer Chris Calnan’s story opened with a comment about Infoglide that nicely sums up the evolution of the broader market for identity resolution and entity analytics: ‘The market may have finally caught up with Infoglide Software Corp.’s technology.’”

OCDQ Blog: Beyond a “Single Version of the Truth”

“However, in his excellent book Data Driven: Profiting from Your Most Important Business Asset, Thomas Redman explains: ‘A fiendishly attractive concept is… ‘a single version of the truth’…the logic is compelling…unfortunately, there is no single version of the truth. For all important data, there are…too many uses, too many viewpoints, and too much nuance for a single version to have any hope of success. This does not imply malfeasance on anyone’s part; it is simply a fact of life. Getting everyone to work from a single version of the truth may be a noble goal, but it is better to call this the ‘one lie strategy’ than anything resembling truth.’”

RISK&INSURANCE: States of Disparity

“Risk & Insurance® looked at four factors that indicate how well a state’s workers’ comp system may be working. Those factors were adjusted by giving additional weight to the amount of premium charged to the employer, and the benefits paid to claimants. The states are ranked by their composite score.”

Security Management: DHS Official Outlines Federal Support to State-based Fusion Centers

“To better facilitate information sharing, Johnson promised DHS will deploy personnel to all fusion centers while giving fusion centers access to the Homeland Security Data Network by the end of fiscal year 2010. Currently, I&A has 44 field representatives based in fusion centers nationwide. I&A will also manage the newly created Joint Fusion Center-Program Management Office (JFC-PMO), which Napolitano tasked in October with coordinating how DHS’ various components and other federal agencies will support fusion centers.”

MedicExchange.com: EMR likely to boom throughout 2013

Health IT currently is growing at an 11 percent annual rate, and solid growth should continue at least through 2013, which would be the third year of the federal EMR stimulus program here in the States, the Scientia report forecasts. In that time frame, health IT will increase its market share by a quarter, to 5 percent of global healthcare products sales from the current 4 percent.”

Identity Resolution Daily Links 2009-11-09

Monday, November 9th, 2009

By the Infoglide Team

NYTimes Dealbook: Insider Scheme Had Touches of James Bond

“Unlike the Galleon case, where senior officials at corporations passed tips on early earnings estimates to people at the fund, the Goffer case centers on allegations that may sound more familiar to students of the insider trading scandals of 25 years ago — early tips about deals from the people involved in doing them. According to the criminal complaints, Mr. Cutillo passed the information along through a friend, Jason C. Goldfarb, 31, who specialized in workers compensation law at a private firm in Brooklyn and who was also arrested on Thursday.”

Computerworld: Data quality vendors missing the mark, study finds

“One-fifth of respondents felt data quality is a prerequisite to an MDM initiative and wanted to see more vendor offerings integrating those two areas. Hayler says one would expect vendor partnerships between the areas of data quality and MDM, and that is precisely what is currently happening in the industry.”

docinthemachine: Encrypt EHR — Else HIPAA Violations Need Be Reported To Government & Media

“For example, if a physician maintains patient information in a laptop computer containing the unsecured information of more than 500 patients and the laptop is stolen, the physician would be required to notify not only the patients affected by the breach, but would likely need to also notify the DHHS and the media. A medical practice need not report a breach if the patient information has been properly encrypted – because information that is encrypted is not considered ‘unsecure.’”

Initiate Blog: The Brittle Nature of Data Warehouses

“Usually, only a small percentage of the data are ever used. So why bother? The TCO for extracting, copying, converting, transferring, transforming, integrating, propagating, backing-up, loading, and verifying the data skyrockets far beyond its value and injects significant risk and brittleness into the entire ecosystem.”

Identity Resolution Daily Links 2009-11-02

Monday, November 2nd, 2009

By the Infoglide Team

Come by and see us at TDWI World in Orlando Nov. 3 & 4, Booth 405

The Emculturated World: Unmanage Master Data Management

MDM breaks down in the moment it becomes divorced from a practical, immediate attempt to capture just what is needed today. The moment it attempts to “bank” standard symbols ahead of their usage, the MDM process becomes speculative, and proscriptive.”

Governing: Can I Say No to an Electronic Health Record?

“In some instances, patients don’t even know their information is being shared. For example, if consumers turn over prescription drug records when applying for life insurance, the insurer will sometimes hand off the information to business partners who then hand it off to data miners. To keep a tighter grip on privacy, Deven McGraw, director of health privacy at the Center for Democracy and Technology, would like a set of rules that all organizations in the health IT world would have to follow.”

Related post: “Applying Identity Resolution to Patient Identification Integrity”

San Antonio Express-News: McManus recalls 9-11 at GEOINT summit

“Bart Johnson, acting undersecretary for intelligence and analysis with the Homeland Security Department, said cooperation is improving, although problems remain with security clearances and interdepartmental connectivity. ‘The federal government can only do so much in getting it down to the street level,’ Johnson said. Homeland security and Justice Department officials have formed 72 “fusion centers” — terrorism prevention and response centers where federal agencies work with the military, local law enforcement and private partners. Three are in Texas: Austin, Dallas and Collin County near Dallas.”

information management: From Search to Explore

“It’s no surprise that people are looking at more and more internal and external resources for informed decision-making. In the internal case, data integration is a foundation of master data management as well. But integration for BI to common visual tools is increasingly taking place in subsystems, relational databases and cubes, and the visualization layer itself.”

Identity Resolution Daily Links 2009-10-26

Monday, October 26th, 2009

By the Infoglide Team

Come by and see us at TDWI World in Orlando Nov. 3 & 4!

Forbes.com: Who Is In Charge Of Your Data?

[Dan Woods] “But in most companies, no single person is charged with the task of making sure that the right data is being captured in an efficient way that ensures data quality. The Data Warehousing Institute estimated the annual cost of poor data quality at $600 billion in 2002. Other studies have produced similar estimates.”

Austin American Statesman: Clerk accused of absconding with lottery cash

“So when the 25-year-old quit his job at the convenience store and claimed a $1 million lottery jackpot in Austin, Joshi’s co-workers were suspicious and told investigators, the affidavit said. Those investigators now believe that in May, after a regular customer brought in his lottery tickets and asked Joshi to check if they were winners, Joshi kept the winning ticket, did not tell the customer and claimed the prize for himself, according to the affidavit and Travis County Assistant District Attorney Patty Robertson.”

Hartford Business: State Recommits To Fighting Shadow Labor

“The state board charged with cracking down on employers who fail to pay employee taxes and workers’ compensation premiums will meet on Nov. 5, following a 10-month hiatus.”

cnet news: Gartner: Brace yourself for cloud computing

Cloud computing takes several forms, from the nuts and bolts of Amazon Web Services to the more finished foundation of Google App Engine to the full-on application of Salesforce.com. Companies should figure out what if any of those approaches are most suited to their challenges, Gartner said.”

Privacy – A Dying Concept?

Wednesday, October 7th, 2009

By Gary Seeger, Infoglide Vice President

An intriguing post by Nate Anderson on Ars Technica highlights a difficult reality about today’s easy availability of vast quantities of “anonymized” data. Quoting from a recent paper by Paul Ohm at the University of Colorado Law School, Anderson writes that “as Ohm notes, this illustrates a central reality of data collection: ‘data can either be useful or perfectly anonymous but never both.’”

A seminal study published in 2000 by Latanya Sweeney at Carnegie Mellon opened the issue by proving that a simple combination of a very small number of publicly available attributes can uniquely identify individuals:

“It was found that 87% (216 million of 248 million) of the population in the United States had reported characteristics that likely made them unique based only on {5-digit ZIP, gender, date of birth}. About half of the U.S. population (132 million of 248 million or 53%) are likely to be uniquely identified by only {place, gender, date of birth}, where place is basically the city, town, or municipality in which the person resides… In general, few characteristics are needed to uniquely identify a person.”

Faced with a choice between exploiting easily obtainable data for righteous ends versus the potential misuse of identifying individuals, can an appropriate balance be struck by privacy legislation? Anderson points out that:

“Because most data privacy laws focus on restricting personally identifiable information (PII), most data privacy laws need to be rethought. And there won’t be any magic bullet; the measures that are taken will increase privacy or reduce the utility of data, but there will be no way to guarantee maximal usefulness and maximal privacy at the same time.”

Looking at the subject from a business perspective, using technologies such as identity resolution to connect non-obvious data relationships serves many initiatives. It would seem admirable to exploit public records and other forms of publicly available information to mitigate risks, uncover fraud, or track down “bad” guys. Yet some cry foul when the technology exposes individuals who didn’t anticipate that their “private” information would be used to identify and/or track them down.

In the rapidly evolving cyber-information age, the desires, conflicts, and limitations of protecting privacy will continue to be sorted out in the legal realm. Those of us who solve business issues using identity resolution technology will swim in this legal quagmire for many years. Finding an appropriate balance between the protection of individual privacy and bona fide business uses of “public” data will almost certainly be a growing challenge to the moral and legal minds of our community.

To Move or Not to Move: That is the Question

Wednesday, September 30th, 2009

By Robert Barker, Infoglide Senior VP & Chief Marketing Officer

A continual theme at IdentityResolutionDaily is maintaining the privacy and confidentiality of data at all times. Two recent posts concerned fusion centers and citizen profiling, but the same issues apply to virtually any application of entity resolution technology. The fact is that, in some cases, anonymous identity resolution is a requirement for more sensitive identity resolution implementations.

The strong emphasis in data management for the last decade or so has been to implement data warehouses, data marts, and master data management. When bundled with associated processes like data extraction, transformation, and cleansing, these methods have been widely accepted as the best approach to solve any data problem. Here at IdentityResolutionDaily, we tend to talk about this over-handling of data as “data deterioration.”

A more basic approach is simply working with data sources undisturbed in their native environments. New principles suggest that you should perform scoring analyses as close to the source as possible. By exploiting existing security layers already in place, the need to add new layers of security is obviated.

Of course, for key sources of operational data, existing IT policies may deny direct access. In other cases, it may be necessary or preferable to move data for other reasons. For example, achieving desired performance parameters may dictate working with an extracted subset of the data rather than the entire data store.

The point I’m making is not to forbid moving data or creating data marts under any circumstances. Rather, I’m suggesting that the most rational approach is the following:

  1. Develop solutions that adapt easily to multiple, disparate, remote data sources.
  2. Default to leaving data where it lives whenever and wherever possible.
  3. Provide the appropriate levels of entity anonymity within the solution and with the least possible intrusion to the enterprise.

Bad Behavior has blocked 956 access attempts in the last 7 days.

Close
E-mail It
Portfolio Strategy News The Direct Marketing Voice