StudentShare
Contact Us
Sign In / Sign Up for FREE
Search
Go to advanced search...
Free

Survey in Multimedia Data Mining by Content in Social Media - Literature review Example

Cite this document
Summary
"Survey in Multimedia Data Mining by Content in Social Media" paper has managed to illustrate one data mining technique that has been successful in the social multimedia domain. The advance of systems, in particular the web-based social systems, has heightened in recent years in an exponential manner  …
Download full paper File format: .doc, available for editing
GRAB THE BEST PAPER91.2% of users find it useful

Extract of sample "Survey in Multimedia Data Mining by Content in Social Media"

Survey in Multimedia Data Mining By Content in Social Media Name: Institute: Survey in Multimedia Data Mining By Content in Social Media Introduction The advance of systems, in particular the web-based social systems, has heightened in recent years in an exponential manner. Recently, academics and researchers started to examine a range of data mining techniques to assist experts enhance social media (Bhatt & Kankanhalli, 2011). Arguably, these techniques permit experts to discern novel information derived from users’ application data. Subsequent to this line, data mining has remained to be one of the most promising areas. Lately, a range of community services as well as Web-based sharing like YouTube and Flickr have made a massive and hastily mounting amount of multimedia content accessible online. Content uploaded by partakers in these vast content pools is escorted by wide-ranging forms of metadata, like descriptive textual data or social network information. Multimedia Data Mining: State-of-art According to Yu et al. (2009), these sets offer, without delay, new-fangled challenges and exhilarating chances for research in multimedia field. Basically, social multimedia offers a considerable prospect for Multimedia services and applications. Afar the level of accessible content, these services ensure novel context metadata as well as information in relation to the content is extensively accessible. Djeraba, Gabbouj, and Bouthemy (2006) posit that such data may entail scores of features: textual descriptors, data concerning the content capture location, the properties of camera’s metadata, and also user data as well as information in the social network. Arguably, these extra metadata can be utilized to develop and increase multimedia along with content examination methods. Additionally, social multimedia incarcerates and leverages society action in the region of multimedia data, through unambiguous user contribution such as comments and tags along with inherent participation from users (Naaman, 2012). Social Media Content Undeniably, social multimedia as well presents the prospect to design systems such as interactive systems, which bring out novel unambiguous and embedded metadata from user communication. Zhou et al. (2013) affirm that such communication and user participation is over and over again steered by community impulses and has the ability to enhance the information present for multimedia applications. Therefore, social multimedia presents a number of prospects that surpass and top other Web-based multimedia sources wherein lots of these prospects are not accessible. In spite of information scale and source, analysis of multimedia content still remains to be a complicated setback. Notably, Raducanu and Gatica-perez (2012) defined this semantic fissure as an inconsistency flanked by the data that can be mined from the available data as well as the construal that the similar data seizes for a user in a certain circumstance. Even up-to-date developments in computer vision have failed to resolve the semantic fissure setback; as a result, scores of open setbacks in multimedia cannot hitherto make suitable application of content analysis methods single-handedly (Si et al., 2012). Simultaneously, social media is absolutely not an enchantment medication, in particular bearing in mind that it is not free of its personal considerable challenges and limits. The above mentioned accessible and context metadata are raucous and frequently imprecise, erroneous or deceptive; consequently, there is insufficient ground certainty for social media data. Chrysostomou et al. (2009) posit that the noise and semantics shortage make also the trouble-free metadata, user-offered tags, complicated to employ. For instance, a Richard Mickey video tagged fails to reveal to us whom Richard Mickey’s tag refers to, and it may fail to portray any Richard Mickey in it at all. Additionally, the semantics shortage proves that there is no wrong and right in tagging: that Richard Mickey video could have been recorded on a journey to view a certain game park; or perhaps recorded from Richard’s house, but fails to depict Richard himself (in both instances, the tag still conveys a number of valuable significance for the user who dispensed it). According to Mulekar (2004), these matters of precision augment even prior to considering concerns of malicious and Spam content that attach additional challenges in public and open systems. Outstandingly, social multimedia mining and search involves change of focus from conventional multimedia applications. Essentially, the accessibility of content does not need broad-spectrum classification as well as detection undertakings (Teredesai et al., 2006). Critical Review of Data Mining The key basis that data mining has generated a lot of interest in the information sector in current years is because of the wide accessibility of enormous quantities of information and the looming desire for revolving such information into helpful data and facts (Raducanu & Gatica-perez, 2012). In essence, data and facts achieved can be employed for utilisations, which range from manufacture control, business administration, market examination, and research projects to science discovery and manufacturing design. Fundamentally, data mining can be observed as a product of the normal development of IT. The evolutionary course has been eye witnessed in the database industry in the advance of the functionalities such as information grouping and database formation, information administration (storage, extraction), and information examination and comprehension (data warehousing and mining) (Teredesai et al., 2006). Currently, data can be amassed in various distinct forms of databases. For instance, one of the database structural designs that have lately surfaced is the data warehouse, which is a store for manifold varied information sources, arranged under a united plan at one website so as to smooth the progress of management. According to Chrysostomou et al. (2009) data warehouse technology consist of online analytical processing (OLAP), data cleaning, and data combination, which are examination methods with features like consolidation, summarization, and aggregation, and the capacity to observe data from various viewpoints. Even though online analytical processing equipment brace multidimensional examination and management, supplementary information examination kit are needed for in detail examination, like data categorization, grouping, and the classification of information alteration over time (Djeraba et al., 2006). The large quantity of information attached with the demand for commanding data examination gear, has been portrayed as information loaded, but data deprived state of affairs. The finest rising, great quantity of data, gathered and amassed in enormous and various databases, has far surpassed the current individual capability for understanding with no commanding equipment. Subsequently, significant resolutions are time and again made established not on the data loaded information amassed in databases, but instead on a the educators perception, merely for the reason that the educators lack the apparatus to retrieve the costly information entrenched in the huge data quantity (Yu et al., 2009). Additionally, take into account the present professional system technologies, which characteristically depend on domain specialists or users to key in information by hand into information foundations. Regrettably, this process is vulnerable to prejudices and faults, and is exceedingly expensive and time consuming. In this regard, data mining gear carry out data examination and may possibly expose vital data models, contributing to a great extent to industry schemes, information foundations, and technical and medicinal research (Chrysostomou et al., 2009). Multimedia Data Mining Based on Social Media Content As mentioned above, social multimedia provides distinct channels for multimedia research, which entails: examining social actions around multimedia wherewithal; obtaining metadata from community resources and activity; and content polling in social environments. One latent advantage of social multimedia is the prospect to mass data or examine activities around entity wherewithal to well reason in relation to their content (Thuraisingham, 2007). For instance, Pahl (2004) make use of Instant Messenger’ chat activity to reason about the shared online videos content; Zhuhadar, Yang, and Lytras (2013) examine comments on Facebook videos to develop topics as well as interestingness; and Bhatt and Kankanhalli (2011) make use of social activity for web lectures’ social navigation. Recently, Zhou et al. (2013) utilized the Trends, volume and content of tweets in Twitter with regards to a multimedia broadcast to reason about the event content. In Bhatt and Kankanhalli (2011) work, they did not amass activity around a particular source, but instead utilized distinct techniques to pool items of the content collectively for the examination. Data intricacy and volumes remains to be information overload setback, and it is unfeasible to resolve the data overload problem by any means. This is for the reason that it takes well-built effort to make use of automatic and intelligent software paraphernalia for changing data that is rough into helpful information, which ultimately is changed to knowledge (Wu et al., 2010). Basically, Data mining remains to be part of the fundamental activities related with comprehending, navigating, and taking advantage of available digital data. According to Raducanu and Gatica-perez (2012), data mining is an automatic and intelligent procedure of discovering and recognizing helpful data structures like relations, models, and patterns. Mulekar (2004) views data mining as an element of the general information detection in data processes. Teredesai et al. (2006) describe data mining as a procedure of discerning a range of derived values, summaries, and models from a known data collection. In this regard, data mining must be an iterative as well as cautiously designed procedure of making use of correctly analytic methods to mine concealed, helpful information. Stage of the Data Mining Process The archetypal data mining process comprises a number of stages as well as the entire process is intrinsically iterative and interactive. Basically, there are six main phases of multimedia data mining process and they include: comprehension of the domain; selection of data; data preprocessing, cleaning transformation; patterns detection; analysis; as well as reporting and making use of the learnt knowledge. In comprehension of the domain phase it needs learning the way data-mining results can be employed in order to collect all pertinent aforementioned knowledge prior to mining (Bhatt & Kankanhalli, 2011). The selection of data phase needs the user to aim the database or choose fields’ subset or records of data to be utilized for multimedia data mining. The goal of preprocessing stage is to discover important features from raw data. According to Bhatt and Kankanhalli (2011), the preprocessing phase consists of data integration from distinct sources as well as resolving how to code or represent certain data domains that double as inputs in the phase of pattern discover. The phase of pattern discovery is arguably the centre of the whole data mining process, given that it is the phase wherein the veiled patterns, trends as well as relationships in the data are truly exposed. The phase of interpretation in the process of data mining is utilized to assess the discovery value and quality to establish if the preceding phases must be revisited or not. Finally, data mining process entails reporting and making use of the learnt knowledge to create new-fangled marketing strategies or actions (Bhatt & Kankanhalli, 2011). Multimedia Data Mining Challenges According to Zhou et al. (2013), data mining is vital as experts struggle to resolve data intricacy and overload problems. Owing to informational technology advances as well as high-functionality computing, enormous images sets like computer simulations images, digital sky surveys, digitalized or digital images, satellite images, medical images, and images created in loads of scientific domains are turning out to be accessible (Yu et al., 2009). The technique that handles the mining of inherent information, image information rapports, and extra patterns not unequivocally stockpiled in the image databases is recognized as image mining. The key problem of image mining is handling comparative information, inherent spatial data, and manifold construal of the equivalent visual patterns. According to Raducanu and Gatica-perez (2012), it is possible to take into account the well-designed the image-driven approach and application-oriented approach. Bhatt and Kankanhalli (2011) posit that the core intend of the multimedia data mining is to mine appealing information and comprehend semantics incarcerated in multimedia data that hold allied video, audio, images, and content. Multimedia databases having mixtures of diverse forms of data might be foremost incorporated through disseminated multimedia processors and afterward extracted, or one might use data mining paraphernalia on the databases that are homogenous and afterward integrate the outcomes of the different data miners (Djeraba et al., 2006). Solution to These Challenges To solve challenges in data mining, a user can make use of data warehouse, which provides a platform for effectual data extraction/mining. In essence, the data warehouse comprises: exhaustive and abridged information, integrated data, metadata, and historical data (Zhou et al., 2013). Arguably, all of these components improve the data extraction procedure and the views for achievement. Data marts, such as online analytical processing (OLAP) are an architectural annex of data warehouse for the reason that the mart have tailored data and runs on data at an extreme level of summarization. Data warehousing is a tactical enterprise and IT scheme in various organizations at the moment. A data warehouse is the notion of data mined from functional systems and made accessible as historical snapshots for impromptu queries and listed reporting. In essence, features that differentiates data in the data warehouse from that established in the functional setting is that it is: structured in a way that pertinent data is grouped mutually for unproblematic access (Wu et al., 2010). Furthermore, numerous copies of the data from different points over time are reserved jointly, and once the data is stored in the data warehouse it is not restructured. Instead, the historical snapshots stockpiled in the Data Warehouse are occasionally invigorated with data from the functional databases. According to Mulekar (2004), the key issues addressed by a data warehouse are that clients endure hard time generating ad-hoc or other focused reports and queries and. This is brought about by numerous aspects such as the majority of the data is stockpiled in Adaptable Data Base System (ADABAS), which is prickly for clients to access. In addition, the data stores were intended for business procedures and not ad-hoc reporting. Another aspect is that achieving the data often needs waiting for a programmer to either offer a tailored downloaded program or build up the report and at other times every data may not be reliable as of the same point in due course (Pahl, 2004). Conclusion In conclusion, it is apparent that we live in an exhilarating point in time for research in multimedia domain, as the social multimedia epoch heralds rapid transitions in the type and amount of accessible content, in the profundity and facets of the metadata, in the domains that make use of multimedia applications. Basically, these transitions justify call for setbacks that can influence the new-fangled trends, maybe over and above using an improved prospect to iterate on available multimedia setbacks. The article has managed to illustrate one data mining technique that has been successful in the social multimedia domain. References Read More
Cite this document
  • APA
  • MLA
  • CHICAGO
(Survey in Multimedia Data Mining by Content in Social Media Literature review, n.d.)
Survey in Multimedia Data Mining by Content in Social Media Literature review. https://studentshare.org/information-technology/2051308-survey-in-multimedia-data-mining-by-content-in-social-media
(Survey in Multimedia Data Mining by Content in Social Media Literature Review)
Survey in Multimedia Data Mining by Content in Social Media Literature Review. https://studentshare.org/information-technology/2051308-survey-in-multimedia-data-mining-by-content-in-social-media.
“Survey in Multimedia Data Mining by Content in Social Media Literature Review”. https://studentshare.org/information-technology/2051308-survey-in-multimedia-data-mining-by-content-in-social-media.
  • Cited: 0 times

CHECK THESE SAMPLES OF Survey in Multimedia Data Mining by Content in Social Media

Data Mining and Web Personalization

This paper ''data mining and Web Personalization'' tells that Web personalization is a business solution that is becoming more involved as technology advances and consumers can be more selective in how they pursue business relationships....             data mining as used in web personalization allows business owners to concentrate on solving many problems and making e-commerce more of a viable business option....
8 Pages (2000 words) Essay

Multimedia Project Management

ARE specialists also design and administer databases and data warehouses, analyse and implement enterprise-wide solutions to information problems and manage telecommunications efforts.... The database contains the data provided by the AIS.... Also, both data and information are entered from the environment.... This paper ''multimedia Project Management'' tells that Information Systems is serving the purpose of organizational and managerial needs by applying Information Technology....
7 Pages (1750 words) Essay

Data Mining Technologies

data mining is defined as: “a… 1).... It is not possible to give specific advice, but there are four general principles Running Head data mining data mining At the beginning of the 21st century, organizations depend upon information technology unsuccessful use of information systems management.... data mining is defined as: “a decision support process in which we search for patterns of information in data” (Pushpa 2007, p....
2 Pages (500 words) Essay

Data Mining Techniques

In the essay “data mining” the author analyzes different techniques to store large amounts of data inaccessible form.... hellip; The author states that data mining is a set of statistical techniques that aim to identify trends, patterns, and relationships in data.... data mining activities are of three types.... he three main drawbacks of using data mining are:Issues relating to the security of the information obtained by companies about individual customers....
2 Pages (500 words) Essay

Social Data in Context

It may be also as a result of the intermarriage between the ethnic groups and the Hispanics leading to their… Other factors may include occupational activities with the residents making them to settle and start reproducing. a) Median value is a set of measurements which takes the middle position when the measurements are arranged in the ascending order or social Data Question The percentage of Hispanics residing in California, Texas and Florida combined is 18.... It is used in measuring midpoint of a large set of data distribution which will latter reflect the central value of the distribution....
2 Pages (500 words) Assignment

Data Mining and Big Data

However, confusion reigns with many regarding Big Data is an over-hyped buzzword for data mining”, I agree with it. data mining entails the process of… On the other hand, the term, ‘Big data' describes a massive structured and unstructured data volume, which is so complex to process using the common or traditional 2 May Do you agree with this ment?... However, confusion reigns with many regarding Big Data is an over-hyped buzzword for data mining”, I agree with it....
2 Pages (500 words) Assignment

Application of Data Mining

data mining is used to generate revenues, cut costs, or get an insight… Recently, it has been revealed that the government has also been using an advance form of data mining where they are able to get the complete information about any individual.... This paper presents an analysis of data mining practices that are used by the government in order to secure or uncover people's data.... data mining has many advantages in different fields....
5 Pages (1250 words) Essay

Studying Human Mobility through Photograph Geo-Tagging

In the essay “Studying Human Mobility through Photograph Geo-Tagging” the author focuses on four major points: organization of georeferenced media sources; data mining of organized knowledge; identifying the world's landmarks and identification of the photo location, geographically.... With many people uploading photos in public and social sites, travel and tracking human mobility has been given a fresh look.... The first step in understanding the mobility of humans by the authors was to make use of a large amount of data available in the online community....
9 Pages (2250 words) Research Paper
sponsored ads
We use cookies to create the best experience for you. Keep on browsing if you are OK with that, or find out how to manage cookies.
Contact Us