JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.

com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

154

Developing Integrated Astronomy News Information System based on Curatorial Knowledge and Content Management
Y. Chen, M. Urata, K. Mouri and T. Yasuda
Abstract—This research presents the growing interest of developing more effective techniques for automatic news collection and knowledge edit of astronomy news. We are devoted to the development of such an information management system for Japanese astronomy news, termed “Astronomy News Explorer” (ANE). Curators usually wonder if the research institution’s news can be gathered efficiently, and how past news can be obtained more conveniently. The point is how to immediately use associated news, an easy editing and browsing tool is getting necessary. Therefore, the ANE is designed to help in refining raw data and encourages re-use of contents by a curatorial content management system (CCMS) to structure news management activities for science museum curators. In addition, we purpose features of news management and package can enable one to acquaint different viewpoints with a better teaching material overview of astronomy curators by re-construction of news. The aim is to contribute to widening knowledge of astronomy expertise, supporting news manageing as a platform for making information accessible given their knowledge sharing on the internet, so that they can also harvest new research achievements and interchange the astronomical teaching experience with other curators more convenient. Index Terms—Information Visualization, Content Management System (CMS), Time and Space, Data Management, Astronomy Curator

——————————  ——————————

1 INTRODUCTION

C

urators of a museum are content specialist responsible for an institution's collection and who is involved with the interpretation of museum material. Specifically, the Nagoya city science museum houses one of Japan's largest planetariums in the astronomy section and has a lot of sections on modern technology. The astronomy curator who works there, has the primary function of being a subject specialist, with the expectation that he will conduct original research on astronomical results and guide the new organization in astronomy information. For this reason, astronomy curators usually need to use the astronomy news website to obtain up-to-date news. The existing astronomy websites and news system search function only provide for one time period or one keyword. If we want to gather news from different national research institutions, we must visit these institution websites separately to read all the available news, which is quite inconvenient and time-consuming. Our proposed scheme is not a simple aggregation of all news, optimizing all steps toward the final goal. Firstly, because the vision of timeline and space are the fastest ways to accept information for human beings, we adopt visualization to bring the interaction between the users and our system. The visualization technique can allow our system to pre-

   

sent more information at the same time to the users than traditional news sites adopted by other exploration systems, such as classified date and keyword based search (Fig.1). Secondly, to fully utilize the capability of the curator knowledge and specialty, we adopt a convenient environment to characterize the astronomy knowledge and news according to different interests. On the one hand, it is easier to detect and evaluate the news information, thus our platform for content analysis and information architecture of news can achieve higher performance. On the other hand, more accurate classification of the news enables more effective visualization. As a result, our proposed platform may achieve better performance than only aggregating the as much news as possible by listing. Editing news data in an information management system gives great benefits. For example, systems commonly exhibit a systematic visualization across the characteristic of each one, which necessitates controls to be managed in a knowledge background over the astronomy curators. In order to handle this effectively the information management system for the amount of news is required. In this paper, we have developed multiple applications to address the following problems: (a) How to extract news contents from large scale news collections that may have huge diversities of visual properties and format? (b) How to visualize large scale relation astronomy news (i.e., ———————————————— Y. Chen is with the department of systems and social informatics, Nagoya news topics and their groups) in a limited size screen? (c) University, Japan, 4068601. How to edit news topics and their relationship to conform M. Urata is with the graduate school of international development, Nagoya the astronomy curator’s needs? (d) How to assign the University, Japan, 4068601. news with the astronomy topics and keywords clearly? K. Mouri is with the Nagoya science museum, Japan, 4648601. T. Yasuda is with the department of systems and social informatics, Nago- (e) How to achieve synchronization between news manya University, Japan, 4068601. agement and display?

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

155

Fig. 1.The system architecture of the ANE

Section 2 provides related work within the context of using data browsing technologies to align different news sources or information. In Section3, we elaborate on the created news browsing applications in the ANE and we discuss the evaluation with our application functions by astronomy curators who work in the Nagoya city science museum. In Section 4, accordingly, a layered approach is presented that builds upon and combines formal representations of existing browsing systems, and presents the management service that is built around our system and the new features of news management and news data reconstruction in the ANE. A discussion is shown in Section 5 to point out what problems we solved. Finally conclusions are drawn in Section 6.

2 BACKGROUND
To bridge the user’s needs and a large scale data, visualization is widely used to help the users explore large amounts of information and find interesting parts interactively. Visualizing news documents published over classification can facilitate understanding of event trends. For visualization systems, some researchers use the points of keywords, time, and space to display visualization interfaces (Table1). Many applications for news visualization have been developed; researchers already apply subjective measurements to accompany objective results. The visual news interface represents a compromise between the magnitude of visualization techniques and the interactive application for news websites.Subsequently, usability studies provided approximations of how the systems would respond in slightly more realistic task settings with people who more realistically represented the target users

[1]. Nomata (2007) proposed a novel visualization system for news articles that supports the exploring, the observation and the supplying of visual summaries of news articles [2]. Matsumoto (2005) proposed a multi-channel dissemination system with a time dependent filter and an application technique for time-series documents on the Internet. He also took a push-based application method based on confidence and scoop levels to describe a prototype system [3]. The Wall Chart of World History (1998) displays the reigns of rulers of the major countries in the world along with commentary about those reigns and important historical events [4]. Visually, the timeline makes extensive use of color, shape and scale, and a few other familiar paper timelines show cultural and scientific advances [5]. In-spire has been developed for visualizing text document collections, where statistics of news reports are put on a world map to inform the heat relations among the regions [6],[7]. TimeMine is proposed to detect the most important reports and organize them through timeline [8]. The newsmap, which can organize news topics from Google news on a rectangle, is where each news story covers a visualization space that is proportional to the number of related news pages reported by Google. News titles are drawn in the corresponding visualization space allocated to relevant news topic [9]. When large scale news collections come into view, such available information could be very large and displaying all of them to the users may mislead them. To address this problem, some of these existing news visualization systems also disclose different distribution structures of news, but such distribution structures may not make any sense to the users because maybe it does not fit the user’s real needs [10]. Therefore,

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

156

to incorporate the classification of news topics for achieving more effective visualization and exploration of news collections, several researchers use predefined ontology to assist visual content analysis and retrieval [11],[12],[13]. Event relations are particularly important. Viewing event relations between the news with timelines has some similarities to viewing postulated relationships between propositions in hypertext argumentation systems [14],[15]. Time Manager supports interactive exploration of spatial data on the Internet over time. It has been integrated into several web applications, such as viewing dynamic changes in forest areas, demonstrating movement of white storks in Africa, visualizing earthquake occurrences over time and showing time variations of thematic data. Collocations of date-place were displayed on time-lines and maps. More detailed displays could highlight key names and phrases associated with a selected event [16]. ThemeRiver can visualize a large collection of documents with keywords or themes. They represent the distribution structure of themes and keywords in the collections. It depicts thematic variations within a large collection of documents over time. A river metaphor was used to visualize a document collection’s timeline and respectively select thematic content and thematic strength as indicated by the river’s directed flow, composition and changing width [17],[18]. Established in 1969, ESRI is one of the leading compaTABLE 1 CASES OF COMMON NEWS VISUALIZATION SYSTEM   
Researches and descriptions  Nomata (2007), the observation and the supply‐ ing of visual summaries of news articles.  Matsumoto (2005), an application technique for  time‐series documents on the Internet.  Deacon (1998), commentary important histori‐ cal events.  Mehler (2006), news reports is put on a world  map among the regions.  Swan (2000), the most important reports  through timeline.  Weskamp (2000), news story covers a visualiza‐ tion space that is proportional to the number of  related news pages.  ThemeRiver (Havre et al., 2002), the vertical  distance, or width, of the river indicates the  collective strength of the selected themes. It can  show the history of an event collection.  ESRI’s ArcWeb (http:// www.esri.com/), uses a  map view to display street data, weather data,  photopographic data and flood data.  Perseus DL’s map of top events (Smith, 2002),  involves detecting and visualizing document s  with date and place information.  Time manager (Andrienko et al., 2000), a set  tools provide animation of dynamic changes of  events over geographical locations and time.  T=Time, S=Space, K=Keyword, E=Event Dimensions T S K E

×  ×  ×  ○ ○ ×  ×  ×  ○ ×  ×  ○ ×  ○ ×  ×  ○ ×  ×  ×  ×  ○ ×  ○ ×  ×  ×  ○ ×  ○ ○ ○ ○ ○ ○ ○

nies providing software solutions on geographic information systems technology. It is a comprehensive offering of nationwide data and services that include street data, high resolution imagery, weather data, photopographic data and flood data. ESRI’s software helps organizations understand customer needs, analyze site locations, visualize and map demographic data and identify market trends (http://www.esri.com/) [19]. In the Perseus Digital Library Project, historical events with date and place information extracted from unstructured text have been detected and visualized [20]. The approach is more applicable to visualizing historical events rather than emerging events, which are seldom well covered in published documents. In other hand, to enhance the power of the visual features on various news concepts, the Informedia Digital Video Library project at CMU has achieved significant progresses on analyzing, indexing and searching of news collections [21]. Several applications have been reported, such as keyword-based news retrieval and query results visualization. The DELOS project also has significant progress on multiple areas including information access and personalization, object detection and visualization [22],[23],[24]. By these researches we can specifically take the news changes, the news recording times, the news title keywords, and space performances for the visual features; they are all extremely suitable, advantageous tools that can be displayed as visualization news interfaces. Arranging data according to their relation has been widely used in other visual retrieval applications for the web. Rather than recommending the most interesting news topics to the users, all of these existing news visualization systems prefer to disclose all the available information to the users, and thus the users have to dig out the interesting information by themselves.

3 THE EVALUATION OF THE ANE VISUALIZATION
At March 2011, we follow the needs of astronomy curators to build a system which we call “Astronomy News Explorer” (ANE), that shows all heavenly bodies’ indication and classification accurately, and guide the user to use the GUI by intuition shown in Fig.2. The system aims to provide a news browsing interface for the astronomy curators. Elements of the system can be combined to reflect the specificities of the curator’s request. Visual time and special interfaces are supposed to outperform list interfaces for such task types as nonspecific queries, because they make use of additional semantic information (i.e. place, date). We designed the user interface screens to reflect the curators’ perspectives, and assess curators’ perceptions of the usability of the prototype user interface screens. This system would certainly enhance the quality of the news browsing in respect of activities performed with maladaptive structure [25],[26]. In this phase, an evaluation result was needed and four astronomy curators were recruited to participate in formative evaluation of the prototype. In October 2011, we performed a new experiment for ANE visualization. We used structured interview questions to facilitate the brainstorming.

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

157

Fig. 2.The application of news classification in the ANE

A questionnaire was used as the subjective measure for the study. It comprised items concerning perceived ease of use, perceived usefulness, and intention to use. None of the users has used our system before. The protocol was approved by the astronomy news, and we provided curators with a system introduction and asked them to perform news reading and news searching. They used the prototype of the ANE by internet (http://133.6.221.32/ 04/Astronomy.html). The news was selected for relevance to the news browsing application that we developed. We applied graphic-based and curator-centered conception to elicit the curators’ perceptions of functional requirements for an interactive astronomy news browsing. We have three main tasks. The first task is to find what heavenly body produced the largest number of news occurrences during 2009. Therefore, a user may read more than one news in the time and space applications to complete this task. The second task is to compare the amount of news events in the past 10 years. The third task is to find the classification and the number of shuttle news in NASA, 2009. Before the task, curators mentioned they usually use the astronomy websites or Twitter which can connect the site for reading news directly. For instance, they frequently use the site “Star Information” ( http://news.localgroup.jp/)[27], which is a site that records the daily astronomical information from famous astronomy institutions. Another site is the “APOD” (http://apod.nasa. gov/apod/) [28]; it is a website that presents daily pictures concerning astronomy news or astronomical history. When curators use astronomy news sites, it is essentially easier to get the new information immediately, but it is still difficult to know all the news from the present to the past. However there are plenty of newspapers and magazines making it really hard to read all the news data every day, although curators really need these data to assist them for education works and as research materi-

als.The curator commented that the representation of common news websites is not good enough at present (till now). They disregard the concepts of combining the news occurrence time and universe space. The problem of the existing system is the difficulty to command the relation between all astronomy news; even when searching, the only way is to enter the keyword such as the month or the day and finally obtaining a single news them. It apparently took a lot of time to read and search for the news. Obviously the traditional searching is very inconvenient to curators. After the experiment of ANE, curators mentioned that this system displays the concepts of "Classification", "Time" and "Space" for the astronomy news, using these classifications to perform the news such as from NASA’s website. This system is not only to display the news title but contend to classify them by keywords. It also directly illustrates the time button, the space button, the main subject classifications and the position of the heavenly body to transform the typical selection list (Table2). Curators are desirous of a system that can demonstrate the news or the new information from the different countries‟ institutions, and not only be limited to one area but include other regions. There are various astronomy institutions in research about the number of news items they published that are essentially to be compared for the curators’ needs. Moreover, according to the news distribution or the study obtaining its usefulness, what is the balance and direction for the curators’ next research? It is significant to emphasize the utilization of a visualization system which will be more convenient and efficient for curators. Based on the above, we can find that: (1) Our system provides valuable news service when curators do not have detailed preference. (2) Sophisticated time and space based search techniques perform better when the users have detailed preference and need to learn the fine details. (3) Our system is able to guide the users to build their own knowledge effectively and efficiently. (4) Only

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

158

keyword based search techniques can be adopted to disclose fine details after the system catches the user’s fine TABLE 2 ADVANTAGES AND DISADVANTAGES OF ANE 
  Advantages  ○ ○ ○ ○ Opinions of Curators  I can understand the quanti‐ ties of news and it is very useful.  It is very interesting to find what news  differ between planets. The keyword list is a useful feature for  searching. We can compare the quantities  of interesting news; this is very conven‐ ient.  It is a useful tool for news statistical  procedures. I can’t figure out the “read and unread”  news.  I wonder if the keyword addition and  register features are also needed.  It is difficult to define every astronomy  keyword but it is necessary to manage  them.  It is better to provide some functions for  individual news management.

○ Disadvantages  × × ×

×

preference for making educational programs.

3.1 Problems of ANE
Most existing tools for astronomy news collection and the first vision of ANE still suffer from the following problems. The first problem is how to manage the astronomy news from database and bridge the different knowledge classification. Most existing systems can only support extraction of all news and display them via the same screen. [29],[30],[31],[32],[33],[34]. Due to the huge diversities of visual properties and semantics in news, most existing tools for news classification cannot directly be extended to achieve automatic analysis and exploration of news collections appropriately [35],[36],[37],[38],[39]. The second problem is how to edit most significant news topics from all news collections. This problem is becoming more critical because of the following reasons: (a) The amount of news topics could be very large because there are so many astronomy public institutions; (b) each news may have different importance; (c) each curator may have his/her own interest on the same news topic, and there is no existing approach for the expert’s interests and needs. Most existing techniques for news classification can perfectly work on same keyword domains with constraints of news website contents. However, none of them can effectively classify the huge diversity of news properties in the curator’s knowledge.

organizations, e-commerce websites, libraries, broadcasting and film industry, and educational institutions to handle the content efficiently [40],[41]. The content used by a CMS is stored mostly in a content repository which is a hierarchical content store with support for structured and unstructured data. As the primary role of CMS is to organize content items to make them accessible through intuitive queries, metadata assignment mechanisms are an important feature of CMS. Moreover, the metadata has never had such an important impact on the capability of CMSs to manage, retrieve, and describe content [42],[43]. Nowadays the user is becoming the producer of content, and there is a need to manage this personal content as well, hence the introduction of a Content Management System. Astronomical knowledge, educational employment and curatorial needs would probably lead to different interpretations of the curator’s requirement in this system. There is a standardization effort to develop metadata that “aim at being pedagogically neutral or unspecific to theories and systems” [44],[45]. Therefore, this approach can be questioned as the curator’s situations are specific. Thus, the support of several metadata approaches can be seen as a positive feature of our Curatorial Content Management System (CCMS). In this case, the astronomy curator creates, annotates, organizes and shares the curatorial news content. We introduce an approach to build a CCMS, in which system is created a completely metadata construction. It is not easy to present the significant news topics to the users efficiently and effectively. News collection may carry huge amount of information, however, it is very difficult to exchange information between the user and the computer. Based on the above observations, we have developed a new framework of ANE with news collection and management. We propose a (1) new data capture, (2) data edit, (3) news package and (4) reading history recording in the CCMS (Fig.3). Choosing CCMS as the platform for content management means that ANE does not need to reinvent the content management wheel, but can use suitable parts of CCMS already available, and it allows for the easy integration of the news data with the other edit features of CCMS. For example, the ANE management consists of editors and views. Editors follow the manage/package pattern, many instances of the same editor can be open at the same time, whereas views reside around the editors and shows different information for the selected resource. The result of this process is an integrated tool, preferably hosted in the same web server, with an appearance that can be modified according to the user through its own user manager. In addition, the appearance of the content can be changed totally with the creation of a new template, covering the need of the management of ANE to adapt to the overall environment of the news browsing at any moment.

4

THE SIGNIFICANCE OF CONCEPT OF CURATORIAL CONTENT MANAGEMENT SYSTEM WITH ANE

4.1 NEWS CAPTURE
The database is a dynamic storage system, which implies a complex structure of data with a logical system, which recovers files quickly through simple search patterns [46]. Gaio deals with interactions between the primary source

In general, content management systems are a software application for creating, publishing, editing and managing content. They are widely used by the news and media

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

159

of data and any type of final user [47]. Furthermore, the database contains a base format catalog, which can be built upon for creating news data to the relational database, which is implemented in MySQL, and the Spring Framework to leverage communication with MySQL [48],[49]. Therefore, for data programs, the closed captions and contents can provide abundant information, and such information can be used to detect news topics of interest rather than performing whole contents to obtain the news topics [50],[51],[52],[53],[54] . In this research, the outline of the database is structured into two large sets of data. This inventory has been completed with searches in current astronomy news data from the news websites of AstroArts (http://www. astroarts.co.jp/rss/update-j.xml) and National Astronomical Observatory of Japan (http://www.nao.ac.jp/rss.rdf). The news set contains basic information about each element of news which is inventoried, and allows this data to be re-managed within the editor of this system. Firstly, the system is to read the RSS from Japanese astronomy news data into Yahoo pipes. We use the pipes to make up the information relating to the astronomy description of RSS, general descriptive messages, the inclusion of the news title and news contents in other news data. After that we publish the results of RSS which are edited and put them into the ANE database automatically, as well as other RSS feedings dealing with news every day. Therefore, any user can consult the database or receive results by interacting with the DBMS. Any news browsing system has two fundamental bases: data storage and the Database Management System (DBMS), which saves data while at the same time providing fast and structured access to the data, giving correct management of the news. A necessary condition for this database is for there to be a standard, which in this case uses CMS architecture. For the DBMS, MySQL has been chosen, as it is currently one of the most powerful tools, together with MySQL Server. This is multiplatform (Linux, Windows, MAC), which has certain essential characteristics such as the creation of applications in multiple programming languages (C++, PHP, among others). The structure of the database allows fast, logical and standardized access to the data, which has information about title, date, content and URL among every data. In this way, complete reports can be generated.

quirements by the curator’s educational and research needs which make up the classification element in the management. ANE’s management interface consists mainly of the management explorer, a table view for browsing resources and a number of editors used for editing and visualizing news contents. Computer representations of news are created using an interactive process where it contains a complete list of news resources visible in the management explorer. If we want to access the database in ANE, the news in the database can be reached via the management interface through the scripting language PHP. ANE provides a built-in help system where the user can either browse, much like one can browse the web or have an edit view open which shows context spe-

Fig. 3.The architecture of news management in the ANE

4.2 NEWS MANAGEMENT
Users of news websites or applications require a mature function to obtain news smoothly and simply. We want to solve existing problems in traditional news collection system, such as the browser only display news but no function for management. The subject used to classify astronomy objects should be chosen carefully. It determines the clarity of the objects connections for existing astronomy news. Thus, solving problems of the content of the collaboration between news and curators is not settled. The ANE provides a frame for the bridge of different news sources, and the element of the astronomical proper noun depends on the experts gathered to define them as what the curator demanded. The second subset deals with re-

cific assistance for what the user wants. The idea that the metadata should be governed by a standard arose to avoid problems is derived from the definitions provided for each news data (Fig. 3). Therefore, it may seem contradictory to use specific vocabulary for a specific field of astronomical knowledge. The task is to include management describing the specific news, or any other metadata usefulness, within the databases of the major news search, thereby achieving better search optimization. We divide the information relating to the description of the news browsing system into a series of tags known as Meta-Tags, which have information on “Date”, “Title”, “Description”, “Link”, “Language”, “Country”, “Source”, “Keyword”and “Subject”. The Meta-Tags follow the classification and the vertical character of the astronomy expertise above. The  editor  manages  the  news  related  information,  acquisition  and  reproduction  of  in‐ formation  for  a  collaboration  application  which  focuses  on  the  management  of  information  transport  between  data  and  database.  The  system  instances  could  be  de‐ ployed  automatically  providing  a  browsing  interface  which is well defined between those components.  

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

160

Fig. 4.The architecture and applications of news package editor

  The  system  coordination  server  would  be  started  first  and  would  load  the  news  data.  Then,  each  news  data  would retrieve the connection list to the application to be  established.  The  management  of  the  news  browsing  sys‐ tem is necessary for these services to construct news clas‐ sification  and  of  course  this  information  is  contained  in  the metadata, written in the database.  

4.3 NEWS PACKAGE
A news package feature is assigned to each news topic via curator analysis, and such management is further used to decide the importance of the relevant news groups for filtering less important news topics. The associations among the news groups are extracted for achieving more effective visualization and interactive exploration of news collections. Once raw data has been imported, the news package can be used for viewing the results for news groups. The news package has multiple tabs where data can be viewed in different ways. By using the proper tabs of the news editor, news wells can be marked as specialized, effectively removing them from further catalogues. To help the user identify defines the catalogue functions can be viewed. For example, when a news package is created with markers indicating wells used for functions like groups of different subjects, all wells with those markers are created by curators, or if doing primary screening the user can write a new classification for new projects or

new events. The relations among different news packages are also very important to enable more effective retrieval and exploration of news collections. Because the news packages have already been extracted, the relations among these news packages can further be extracted according to their co-occurrence probabilities (Fig. 4). Based on this observation, a new packages relation network is used for knowledge interpretation. The names for news reporters in those text sentences are generally not separated the content of the news report. They are not appropriate for news semantics interpretation and should from be removed. Therefore, using the function of news package, the curator uses news topics (i.e., keywords of heavenly bodies) as nodes and their relations as edges, and become the edges according to their interestingness for other curators. Curators take the advantage of news contents and achieve automatic news catalog detection from news closed captions. In news documents, some special text keywords, such as ‘‘solar eclipse, black hole, and Project Apollo”, can be processed separately. A visualization classification is incorporated to acquaint the users with a better global overview of news information, so that curators can explore useful news materials interactively with each other. 

5 DISCUSSION
Increasing news production has led to increasing com-

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

161

plexity of news organization, management, and hence, understanding. Classifying news articles and presentations that relate to other information will become increasingly more difficult to construct in a meaningful way so as to help the reader to understand the full nature of the news or the information [25,26]. In this paper, we have developed a novel framework to achieve more effective analysis, retrieval and exploration of news collections and news management. ANE ships as an exclusive edit system for astronomy news; by integrating cross news information from multiple sources and news content analysis results, our proposed schemes can achieve more effective news content detection and curator assignment and bridge the news management successfully. A curator edit area contains the astronomy knowledge and graphical visualization management. To address these concepts, this paper describes a news collection and management system that facilitate five objectives: a. Allow people to extract news contents from large scale news collections that may have huge diversities of visual properties and format through yahoo pipes and the programming of cron. b. Combine the effective aspects of both filtering and browsing and the ability to visualize large scale relation astronomy news in a limited size screen. c. Facilitate ability to understand relationships between independent authored news items conforming to the astronomy curator’s needs. d. Organize disconnected articles into manageable forms, based on the astronomy topics and keywords. e. Allow quick package to related information and the ability to achieve synchronization between news management and display. The retrieved stories can be organized by timeline or space so that the users can easily learn the history of the whole event. Our news visualization of the relation network can acquaint the users with a good global view of the overall information of news collections at the first glance, so that users can specify their queries visually because the relevant keywords for news topic interpretation and the most representative key frames are visible. Our framework can allow users to start at many topics of the related news and navigate towards more details by clicking the relevant news of interest to change the focus. This is a key concept to justify its use in this study, as the aim is to find a structured way of implementing the information from news websites into database system platforms without the need for large scale changes in the structure of its dissemination. In addition to such specific queries, the most relevant web news can also be retrieved. This feature is very important for audiences who want to know more details and relevant reports of the event, and thus our system can provide an efficient way of answering.

6

CONCLUSION

As everyone knows, news collection may carry huge amount of information, however, it is very difficult to exchange information between the user and the comput-

er. Thus the vision is the fastest way for human beings to accept information, supporting news visualization is one potential solution for exploring news relation and significance. Therefore, how to transform the data upon user input to implement user-adaptive exploration is what we want to solve. The ANE, fully utilizes user’s vision capability, the adopted visualization technique adapts to the curator’s needs, the information carried in large scale news collections by an interactive exploration [25,26]. For administrative management, we created an instance of the CCMS including the field of news management and news packaging in charge of different astronomy news sources. These extensions greatly improve the overall performance of our previous work. Before that, we checked the curator who thought of a system interface for the astronomical expertise interested in structuring collaboration according to the curator’s own knowledge. They also allow a system designer to match information management with a user’s requirement. In this system, yahoo pipes items for news content are automatically extracted from raw news by using the analysis technique first. The catalog (e.g., news title, date) for news topic interpretation and their contents (e.g., abstract or news ULR) are extracted from the closed captions. Secondly, the group measurements for the astronomy keywords (i.e., space, time, and classification) items are assigned simultaneously via a management panel. Finally, a novel visualization framework is incorporated to visualize news package relation and support interactive news exploration, so that curators can have better understanding of the amount of relations of news and make better research decisions here. In this paper, we developed news automatic extraction and relation classification, introduced a new approach and proposed a news edit system to perform curator-adaptive visualization. The news management solves the existing problems which are beneficial as technical implementations of users and tools can be modified without any impact on their description in the system, i.e. technical changes have no effect on the system manipulation for the astronomical expertise. The ANE supports news between them by interfacing their respective competence, and the independence is symbolized by the difference between user requirement and association. This difference defines a relationship that guarantees a certain level of influence for all experts. Creation of a complete technical and multimedia database dealing with the astronomy news of websites (AstroArts and National Astronomical Observatory of Japan) in Japan: Search for a coherent means of interaction between results of different elements of the news. In addition, the users can also communicate with other curators by providing their feedbacks, and such feedbacks can further be integrated to improve their teaching analysis and education assignment. They would put forward the teaching material tools supported for their educating activities. The aim is to contribute to widening knowledge of astronomy expertise, supporting news browsing as a platform for making information accessible given their knowledge sharing on the Internet, one of the most powerful channels of infor-

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

162

mation.

ACKNOWLEDGMENT
This research was sponsored in part by the Ministry of Education, Culture, Sport, Science and Technology, Japanese Government under Grand-in-Aid for Scientific Research.

IEEE Transactions on Visualization and Computer Graphics 8, pp. 9–20, 2002.

visualizingthematic changes in large document collections,”

REFERENCES
[1] C.M.Karat, “Cost-benefit Analysis of Iterative Usability Testing,” Interact '90, Proceedings of the IFIP TC 13, Third International Conference on Human-Computer Interaction. 1990. Y. Nomata, and J. Hoshino, “Visualization for Information Exploration and Observation of News,” IPSJ SIG, Technical Report, Information Processing Society of Japan, pp. 53-57, 2007. (In Japanese). K. Matsumoto, K. Sumiya, and K. Uehara, “A Multi-channel Dissemination System Based on Time-Series, Clustering Mechanism for On-Line News,” Technical report of IEICE, NII-Electronic Library Service, Japan, pp.137-144, 2005 (In Japanese). C. W. Deacon, “The Wall Chart of World History,” Bracken Books, London, (revised 1998). C. Scarre, “Smithsonian Timelines of the Ancient World,” Dorling Kindersley Inc, New York, 1993. A. Mehler, Y. Bao, X. Li, Y. Wang, “S. Skiena, Spatial analysis of news sources,” IEEE Trans, Visualization Comput. Graph. 12, pp.765–772, 2006. J.A. Wise, J.J. Thomas, K. Pennock, D. Lantrip, M. Pottier, A. Schur, V. Crow, “Visualizing the non-visual: spatial analysis and interaction with information from text documents,” IEEE Symposium on Information Visualization, pp. 51–58, 1995. R. Swan, D. Jensen, “Timemines: constructing timelines with statistical models of word,” ACM SIGKDD, pp. 73–80, 2000. M. Weskamp, “Newsmap,” hhttp://www.marumushi.com/ apps/ newsmap/index.cfmi. J.J. van Wijk, “Bridging the gaps,” Comput, Graph. Appl. 26 (6) pp. 6–9, 2006. S. Dasiopoulou, C. Doulaverakis, V. Mezaris, I. Kompatsiaris, M.G. Strintzis, “An ontology-based framework for semantic image analysis and retrieval,” Semantic-Based Visual Information Retrieval, Idea Group Inc., 2007. S. Dasiopoulou, C. Saathoff, Ph. Mylonas, Y. Avrithis, Y. Kompatsiaris, S. Staab, M.G. Strintzis, “Introducing context and reasoning in visual content analysis: an ontology-based framework,” Semantic Multimedia and Ontologies: Theory and Applications, Springer, Berlin, 2007. D. Vallet, P. Castells, M. Fernandez, P. Mylona, Y. Avrithis, “Personalized content retrieval in context using ontological knowledge,” IEEE Trans, Circuits Systems Video Technol. 17 (3), pp. 336–346, 2007. J. Conklin, M. L. Begeman, “A hypertext tool for exploratory policy discussion,” ACM Transactions on Information Systems, 6, pp. 303-331, 1988. N. Streitz, et al. “SEPIA: A cooperative hypermedia authoring environment,” In Proceedings ACM Hypertext, pp. 11-22, 1992. N. Andrienko, G. Andrienko, P. Gatalsky, “Visualization of spatio-temporal information in the Internet,” Proceedings of

[19] “ESRI”, http://www.esri.com/. [20] D.A., Smith, “Detectingevents with date and place information in unstructured text,” Proceedings of the Second ACM/IEEE CS Joint Conference on Digital Libraries, ACM Press, Portland, OR, USA, pp. 191–196, 2002. [21] A.G. Hauptmann, “Lessons for the future from a decade of informedia video analysis research,” International Conference on Image and Video Retrieval (CIVR), Lecture Notes in Computer Science, vol. 3568, Springer, Berlin, pp. 1–10, 2005. [22] G. Antini, S. Berretti, A. Del Bimbo, P. Pala, “3d face identification based on arrangement of salient wrinkles,” Proceed-

[2]

ings of the IEEE International Conference on Multimedia and Expo (ICME 2006), pp. 85–88, 2006.

[3]

[4] [5] [6] [7]

[8] [9] [10] [11]

[12]

[13]

[14] [15] [16]

London, UK, pp. 577–585, 2000. [17] S. Havre, B. Hetzler, L. Nowell, “Themeriver: visualizing theme changes over time,” IEEE Symposium on Information Visualization, pp. 115–123, 2000. [18] Havre, S., Hetzler, E., Whitney, P., Nowell, L., “ThemeRiver:

the 11th International Workshop on Database and Expert Systems Applications, IEEE Computer Society, Greenwich,

[23] H. Bruce, B. Cleal, R. Fidel, A.M. Pejtersen, “A multidimensional approach to the study of human–information interaction: a case study of collaborative information retrieval,” J. Amer. Soc. Inform, Sci. Technol. 55, pp.939–953, 2004. [24] S. Christodoulakis, C. Tsinaraki, “A multimedia user preference model that supports semantics and its application to mpeg,” Proceedings of the Multimedia Modeling 2006 Conference (MMM 2006), pp.35–42, 2006. [25] Y. Chen, M. Kondo, K. Mouri, and T. Yasuda, “Astronomy News Element Analysis and Visualization System for the web,” The Fifth International Conference on Innovative Computing, Information and Control Processing, Taiwan, pp. 153-157, 2009. [26] Y. Chen, M. Kondo, K. Mouri, and T. Yasuda, “A Visualization System of Astronomy News Based on Time and Space Mechanism,” The Japan Association for Social Informatics Processing, The Japan Society for Socio-Information Studies, Nagasaki, pp. 183-188, 2010. [27] “Star Information”, http://news.local-group.jp/. [28] “APOD”,http://apod.nasa.gov/apod/. [29] B. Adams, C. Dorai, S. Venkatesh, “Towards automatic extraction of expressive elements from motion pictures: tempo,” IEEE Trans, Multimedia 4, pp.472–481, 2002. [30] W.H. Adams, G. Iyengar, C. Lin,M. Naphade, C. Neti, H. Nock, J.R. Smith, “Semantic indexing of multimedia content using visual, audio and text cues,” EURASIP J. Appl. Signal Processing, pp. 170–185, 2003. [31] K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, M.I. Jordan, “Matching words and pictures,” J. Machine Learn. Res. 3, pp.1107–1135, 2003. [32] J. Maydt, R. Lienhart, “An extended set of haar-like features for rapid object detection,” Proceedings of the International Conference on Image Processing (ICIP 2002), vol. 1, pp. 900– 903, 2002. [33] M.R. Naphade, I.V. Kozintsev, T.S. Huang, “Factor graph framework for semantic video indexing,” IEEE Trans, Circuits Systems Video Technol. 12, pp. 40–52, 2002. [34] C.G.M. Snoek, M. Worring, J.-M. Geusebroek, D.C. Koelma, F.J. Seinstra, A.W.M. Smeulders, “The semantic pathfinder: using an authoring metaphor for generic multimedia indexing,” IEEE Trans, Pattern Anal. Machine Intell. 28, pp. 1678–1689, 2006 [35] C.G.M. Snoek, M. Worring, A.G. Hauptmann, “Learning rich semantics from news video archives by style analysis,” ACM Trans. Multimedia Comput, Comm. Appl. 2, pp. 91–108, 2006. [36] G. Sudhir, J.C.M. Lee, A.K. Jain, “Automatic classification of tennis video for high-level content-based retrieval,” CAIVD ’98, 1998. [37] Y. Wu, E.Y. Chang, K.C.-C. Chang, J.R. Smith, “Optimal multimodal fusion for multimedia data analysis,” ACM Multimedia,2004. [38] W. Zhou, A. Vellaikal, C.C. Jay Kuo, “Rule-based video classification system for basketball video indexing,” ACM Multimedia, 2000. [39] D. Bulgarelli, C. Grana, R. Vezzani, R. Cucchiara, “A semiautomatic video annotation tool with mpeg-7 content collec-

JOURNAL OF COMPUTING, VOLUME 4, ISSUE 2, FEBRUARY 2012, ISSN 2151-9617 https://sites.google.com/site/journalofcomputing WWW.JOURNALOFCOMPUTING.ORG

163

tions,” Proceedings of IEEE International Symposium on Multimedia (ISM2006), pp. 742–745, 2006. [40] S. Ottosson, L. Holmdahl, “Web-based virtual reality,” J. Eng. Desing 18, pp. 103–111, 2007. [41] J. Becking, S. Course, G. van Enk, H.T. Hangyi, J.J. Lahaye, M. Ockeloen, D. R. Peters, H. Rosbergen, R.V. de Joode, “MMBase: an open-source content management system,” IBM Syst, J. 44, pp. 381–397, 2005. [42] H. Allert,, C Richter, W. Nejdl,, “Extending the Scope of the Current Discussion on Metadata 2.Towards Situated Models,” In Wasson, B. Ludvigsen, S., & Hoppe, U. (Eds.), Designing for Change, Netherlands: Kluwer Academic Publishers, pp. 353-362, 2003. [43] M. Gaeta, F. Orciuoli, P. Ritrovato,. “Advanced ontology management system for personalised e-learning.KnowledgeBased Systems,” pp. 292– 301, 2009. [44] W.P. Eveland, K. Jr, Marton, M. Seo, “Moving beyond „just the facts: the influence of online news on the content and structure of public affairs knowledge,” Communication Research, Vol. 31 1, February, pp. 82-108, 2004. [45] S. Sunder, S. Kalyanaraman, J. Brown, “Explicating web site interactivity: impression formation effects in political campaign sites,” Communication Research, vol. 30 No. 1, pp. 3059, 2003. [46] S. Liu, C.A. McMahon, M.J. Darlington, S.J. Culley, P.J. Wild, “A computational framework for retrieval of document fragments based on decomposition schemes in engineering information management,” Adv. Eng, Inf. 20, pp. 401–413, 2006. [47] M. Gaio, C. Sallaberry, P. Etcheverry, C. Marquesuzaa, J. Lesbegueries, “A global process to access documents contents from a geographical point of view,” Comput, 19, pp.3–23, 2008. [48] “MySQL homepage,” http://www.mysql.com/. [49] “Spring application framework,” http://www.springsource.org. [50] B. Adams, C. Dorai, S. "Venkatesh, “Towards automatic extraction of expressive elements from motion pictures: tempo,” IEEE Trans, Multimedia 4, pp. 472–481, 2002. [51] W.H. Adams, G. Iyengar, C. Lin,M. Naphade, C. Neti, H. Nock, J.R. Smith, “Semantic indexing of multimedia content using visual, audio and text cues,” EURASIP J. Appl. Signal Processing, pp.170–185, 2003. [52] K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, M.I. Jordan, “Matching words and pictures”, Machine Learn, Res. 3 pp.1107–1135, 2003. [53] J. Maydt, R. Lienhart, “An extended set of haar-like features for rapid object detection,” Proceedings of the International Conference on Image Processing (ICIP 2002), vol. 1, pp. 900– 903, 2002. [54] M.R. Naphade, I.V. Kozintsev, T.S. Huang, “Factor graph framework for semantic video indexing,” IEEE Trans, Circuits Systems Video Technol. 12, pp. 40–52, 2002.

interests include astronomy education, information systems, astronomy supporting system. T. Yasuda is a professor in the Graduate School of Information Sciences at Nagoya University. Yasuda received his BE and ME in electronic engineering form Mie University in1982 and 1984. He received his Ph.D. in information engineering from Nagoya University in 1989. His research interests focus on the well-designed real society with Information Communication Technology (ICT) and Design. His research is inventing and applying new tools for the analysis and developing of web services.

Y. Chen is PhD Candidate at the School of Information Science, Informatics and Sciences at the University of Nagoya. She studies the news facilitating knowledge management integration for astronomy teaching material and its application in supporting information use. Her main research interests include museum supporting environment, computer interaction and information architecture. M. Urata is a research fellow working on the graduate school of international development at Nagoya University, Japan. She received her Ph.D. from Nagoya University on the field of Informatics and Sciences. Her research interests focus on e- community, i- society and web-based learning. K. Mouri is a curator of the Planetarium in Nagoya City Science Museum. He received his Ph.D. from Nagoya University on the field of Informatics and Sciences. His education includes: BS and MS in Earth and Planetary Science, Nagoya University, Nagoya, Japan in 1986 and 1988. His business experience includes the System engineer of Computer system NTTDATA Corp. 1988-1990. His research