Welcome!

Cloud Security Authors: Shelly Palmer, Don MacVittie, Derek Weeks, Pat Romanski, Carmen Gonzalez

Related Topics: Agile Computing, Industrial IoT, Open Source Cloud, Cognitive Computing , Machine Learning , Cloud Security

Agile Computing: Blog Post

Using Taxonomy to Drive Online Contextual Advertising with Sophializer

Classifying Web Content to the IAB Taxonomy

It’s a Big Market …
…  online advertising.  There are 10,000 stories and data points about it.  Here are two to give some context to the journey below.  First, global online ad spending is projected by ZenithOptimedia to exceed print ad spend by 2015 (note 1).  This 2015 projected spend figure for online advertising is $132.4 billion.  Second, global online ad revenue is projected by another research agency, Digital TV Research, to hit $143 billion by 2017 (note 2).

These are prodigious amounts of money for companies to spend to connect with customers.  But … surely it’s easy to connect online customers to web content featuring, or suggesting, products? And surely, online is “better”?  Where can, and do, taxonomy-based approaches add value to this dance of moving (emotional and semantic) parts between the intentful consumer poised to shop and the intentful marketer with honed content?

Online Ad Targeting is Easy … so 'They' Say …
Really?  So what might be “easy”?  And, indeed, “better”?  Let’s unbundle these simulacra that look like very fuzzy concepts, and as ontologists and knowledge engineers let’s think our way forward with the concept of “precision”.

So … online is more precise than billboards by freeways?  Lightly stated, online has advantages.  What about magazine print ads vs. online?  Online has potential advantages. But … and this is a very big but … in both these cases (and all others) online depends on connecting potential customers to products, their features, their benefits, their attributes and so on precisely, and with precision that is repeatable and extensible.  Rather than random (random is the most expensive way to advertise and has fallen out of favor).  And, since online copy and online ads are words (including in videos) and are semantically classifiable, and since classifications can be organized into models (taxonomies and ontologies) … then there are advantages to be created through the combination of semantic analysis, categorization and taxonomy.

Now, let’s connect taxonomy, classification, semantics and optimizing online ad targeting.  There are a host of holy grails currently being sought in the web/mobile/social uber-ecosystem.  Some are well found, though not perfect, and are unlikely to traverse through a paradigmatic improvement.  Think ‘search’.  Others are most definitely not found (yet).  Given the size of the market outlined in the first paragraph, the rewards are huge to those with the tools and skillsets that know how to work with semantics, taxonomy/ontology, classification of content to taxonomy, and design of taxonomies to drive online targeting.

New Approaches to Classifying to the IAB Taxonomy with Sophializer
Sophia Search
is a recent entrant into this space.  (I have written about them before here.   Sophia Search’s tool – currently called the ‘Sophializer’ – categorizes any URL to nodes in the Internet Advertising Bureau (IAB) taxonomy.  Sophializer can also classify content of ads (and so create a semantic/conceptual ‘signature’ for each). The IAB Contextual Taxonomy comprises three levels:

  • Tier 1 – 23 nodes
  • Tier 2 – 371 nodes
  • Tier 3 – unspecified and vendor specific

Given that Sophializer categorizes both sides of this content dance – web page and ad – web properties can serve ads to any page automatically using the IAB taxonomy as the cross-mapping conceptual foundation.

Sophializer not only classifies to Tier 1 and Tier 2 it also discovers/generates robust classifications that can be used to customize Tier 3 for individual customers.

Benefits of Using Taxonomy for Ad Targeting
Taxonomy gives a framework to this kind of semantic work.  Essentially, we are cross-mapping both partners of this content dance – content and ad - using the IAB taxonomy  as a “choreographer” of sorts.  Other taxonomies could be used.  In fact, multiple taxonomies could be used – and this would be particularly powerful if these taxonomies were cross-mapped to each other.  For example, if you have content (web page, say, or ad) categorized and mapped to Taxonomy A and Taxonomy A is cross-mapped to the IAB taxonomy … then … you can propagate these ads to content that is already categorized.

Benefits of Using Categorization Tools to Assign Marketing Content to Taxonomy Nodes
There are a number of different methods of assigning content to nodes in any taxonomy –

  • Manually
  • Training sets of documents (training documents are most often manually selected as exemplars)
  • Categorization algorithms that work with semantic tokens

There is more than enough to say on each of these around methods, workflows, best practices and pitfalls for a blog post on each.  But not here.

Sophializer utilizes patented and proprietary algorithms in the core of their categorization engine.  Two fundamental points are worth, briefly, focusing on.  Firstly, different categorization engines use different patented technologies.  “Quality” from different categorizers is (very) variable.  Which is why it is important to carry out “Proofs of Concept” when evaluating this technology.

Secondly, the more semantically rich the taxonomy – e.g. fully enriched with synonyms and other types of evidence terms – the better “quality” one gets with any method of associating content to taxonomy nodes.   Both of these parameters are make-or-break (literally) in using semantics to target online ads.

Learn More 2.0
The Google Display Network is IAB Certified and complies with the top 2 tiers of the IAB Contextual Taxonomy.  You can read details of what Google do here and this also navigates you to the Google mapping to the IAB taxonomy Tier 1 and Tier 2.

Sophia Search currently has a number of engagements on the web that are live.  For example, targeting ads for non-fiction books (from a major publishing house) to news stories (on a pre-eminent news site).  You can contact them for details.

This is not an empty space.  Other companies are also searching for the holy grail of taxonomy-based content targeting mediated by content categorization that works.  See, for example, see ADmantX (http://blog.admantx.com/post/15726823528/a-new-iab-based-taxonomy-and-an...).

This whole space is an excellent example of where the application of the nexus of taxonomy, categorization and semantics will provide stratospheric business benefit.  Grails are waiting to be found here.

Notes
Note 1.  See ZenithOprimedia

The detailed ZenithOptimedia figures can be found here

Note 2.  See Hollywood Reporter

You can download the Digital TV Research press release about these figures here

@ThingsExpo Stories
Smart Cities are here to stay, but for their promise to be delivered, the data they produce must not be put in new siloes. In his session at @ThingsExpo, Mathias Herberts, Co-founder and CTO of Cityzen Data, discussed the best practices that will ensure a successful smart city journey.
Web Real-Time Communication APIs have quickly revolutionized what browsers are capable of. In addition to video and audio streams, we can now bi-directionally send arbitrary data over WebRTC's PeerConnection Data Channels. With the advent of Progressive Web Apps and new hardware APIs such as WebBluetooh and WebUSB, we can finally enable users to stitch together the Internet of Things directly from their browsers while communicating privately and securely in a decentralized way.
"Tintri was started in 2008 with the express purpose of building a storage appliance that is ideal for virtualized environments. We support a lot of different hypervisor platforms from VMware to OpenStack to Hyper-V," explained Dan Florea, Director of Product Management at Tintri, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
Data is an unusual currency; it is not restricted by the same transactional limitations as money or people. In fact, the more that you leverage your data across multiple business use cases, the more valuable it becomes to the organization. And the same can be said about the organization’s analytics. In his session at 19th Cloud Expo, Bill Schmarzo, CTO for the Big Data Practice at Dell EMC, introduced a methodology for capturing, enriching and sharing data (and analytics) across the organization...
Manufacturers are embracing the Industrial Internet the same way consumers are leveraging Fitbits – to improve overall health and wellness. Both can provide consistent measurement, visibility, and suggest performance improvements customized to help reach goals. Fitbit users can view real-time data and make adjustments to increase their activity. In his session at @ThingsExpo, Mark Bernardo Professional Services Leader, Americas, at GE Digital, discussed how leveraging the Industrial Internet and...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at 20th Cloud Expo, Ed Featherston, director/senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
Why do your mobile transformations need to happen today? Mobile is the strategy that enterprise transformation centers on to drive customer engagement. In his general session at @ThingsExpo, Roger Woods, Director, Mobile Product & Strategy – Adobe Marketing Cloud, covered key IoT and mobile trends that are forcing mobile transformation, key components of a solid mobile strategy and explored how brands are effectively driving mobile change throughout the enterprise.
IoT is at the core or many Digital Transformation initiatives with the goal of re-inventing a company's business model. We all agree that collecting relevant IoT data will result in massive amounts of data needing to be stored. However, with the rapid development of IoT devices and ongoing business model transformation, we are not able to predict the volume and growth of IoT data. And with the lack of IoT history, traditional methods of IT and infrastructure planning based on the past do not app...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
"LinearHub provides smart video conferencing, which is the Roundee service, and we archive all the video conferences and we also provide the transcript," stated Sunghyuk Kim, CEO of LinearHub, in this SYS-CON.tv interview at @ThingsExpo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Internet of @ThingsExpo, taking place June 6-8, 2017 at the Javits Center in New York City, New York, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @ThingsExpo New York Call for Papers is now open.
The 20th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held June 6-8, 2017, at the Javits Center in New York City, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Containers, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal ...
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web communications world. The 6th WebRTC Summit continues our tradition of delivering the latest and greatest presentations within the world of WebRTC. Topics include voice calling, video chat, P2P file sharing, and use cases that have already leveraged the power and convenience of WebRTC.
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
Discover top technologies and tools all under one roof at April 24–28, 2017, at the Westin San Diego in San Diego, CA. Explore the Mobile Dev + Test and IoT Dev + Test Expo and enjoy all of these unique opportunities: The latest solutions, technologies, and tools in mobile or IoT software development and testing. Meet one-on-one with representatives from some of today's most innovative organizations
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, discussed the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
The WebRTC Summit New York, to be held June 6-8, 2017, at the Javits Center in New York City, NY, announces that its Call for Papers is now open. Topics include all aspects of improving IT delivery by eliminating waste through automated business models leveraging cloud technologies. WebRTC Summit is co-located with 20th International Cloud Expo and @ThingsExpo. WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web co...
"A lot of times people will come to us and have a very diverse set of requirements or very customized need and we'll help them to implement it in a fashion that you can't just buy off of the shelf," explained Nick Rose, CTO of Enzu, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.