Knowledge-Based Systems 23(2010)32 Contents lists available at Science Direct Knowledge-Based Systems ELSEVIER journalhomepagewww.elsevier.com/locate/knosys Dealing with incomplete information in a fuzzy linguistic recommender system to disseminate information in university digital libraries C Porcel , E. Herrera-Viedma b University of Granada, Dep of Computer Science and Artificial Intelligence, Granada, Spain ARTICLE IN FO A BSTRACT As in the Web, the growing of information is the main problem of the academic digital libraries. Thus, ible online 5 August 2009 milar tools could be applied in university digital libraries to facilitate the information access by the stu- dents and teachers. In [46 we presented a fuzzy linguistic recommender system to advice research esources in university digital libraries. The problem of this system is that the user profiles are provide recommender systen directly by the own users and the process for acquiring user preferences is quite difficult because it niversity digital libraries ncomplete fuzzy linguistic preference facilitates the acquisition of the user preferences to characterize the user profiles. We allow users to provide their preferences by means of incomplete fuzzy linguistic preference relation. We include tools o manage incomplete information when the users express their preferences, and, in such a way, we show that the acquisition of the user profiles is improved. e 2009 Elsevier B V. All rights reserved. 1 Introduction Recommender systems are becoming popular tools for reducing information overload and to improve the sales in e-commerce web Digital libraries are information collections that have associated sites [ 7, 9, 35, 40, 49. The use of this kind of systems allows to re services delivered to user communities using a variety of technol- ommend resources interesting for the users, at the same time that ogies[8, 15.48. Therefore, digital libraries are the logical exten- these resources are inserted into the system. In the UDL frame sions of physical libraries in the electronic information society. work, recommender systems [7, 49] can be used to help users These extensions amplify existing resources and services. As such, (teachers, students and library staff) to find out and select their digital libraries offer new levels of access to broader audiences of information and knowledge sources[43 sers and new opportunities for the library In practice, a digital li- Generally, in a recommender system the users'information brary makes its contents and services remotely accessible through preferences can be used to define user profiles that are applied networks such as the Web or limited-access intranets 39, 50]. as filters to streams of documents[7, 47,49 In 45, 46 we devel- As digital libraries become commonplace and as their contents oped some recommender systems in an academic context. For in- become more varied, the users expect more sophisticated services stance in [45 we proposed a fuzzy linguistic recommender om them[8, 15,48, 50 A service that is particularly important is system for a technology transfer office which helps researchers the selective dissemination of information or filtering, to help the and environment companies allowing them to obtain information users to access interesting information for them. Users develop automatically about research resources(calls or projects)in their interest profiles and as new materials(books, papers, reports, interest areas; in [46 we proposed a fuzzy linguistic recommender and so on)are added to the collection, they are compared to the system to achieve major advances in the activities of UDl, which profiles and relevant items are sent to the users [39 recommends researchers specialize Moreover, digital libraries have been applied in a lot of contexts tary resources related with their respective research areas. The ut in this paper we focus on an academic environment. University problem of both recommender systems is that users must directly Digital Libraries(UDL) provide information resources and services specify their user profiles by providing their preferences on all top to students, faculty and staff in an environment that supports ics of interest and it requires too much user effort. learning, teaching and research [11 In th s paper, we focus on the idea of that tem could be seen as a decision support system(DSS)[37, 38, 44 g author. where the solution alternatives are the digital resources inserted -mail cporceleujaenes (C. Porcel) viedmaedecsai ugres into the library, and the criteria to satisfy are the user profiles The proper use of these recommendation systems is essential to 0950-7051/s-see front matter a 2009 Elsevier B v. All rights reserved oi:10.1016 knosys.200907.007
Dealing with incomplete information in a fuzzy linguistic recommender system to disseminate information in university digital libraries C. Porcel a,*, E. Herrera-Viedma b aUniversity of Jaén, Department of Computer Science, Jaén, Spain bUniversity of Granada, Department of Computer Science and Artificial Intelligence, Granada, Spain article info Article history: Available online 5 August 2009 Keywords: Recommender systems Fuzzy linguistic modeling University digital libraries Incomplete fuzzy linguistic preference relation abstract As in the Web, the growing of information is the main problem of the academic digital libraries. Thus, similar tools could be applied in university digital libraries to facilitate the information access by the students and teachers. In [46] we presented a fuzzy linguistic recommender system to advice research resources in university digital libraries. The problem of this system is that the user profiles are provided directly by the own users and the process for acquiring user preferences is quite difficult because it requires too much user effort. In this paper we present a new fuzzy linguistic recommender system that facilitates the acquisition of the user preferences to characterize the user profiles. We allow users to provide their preferences by means of incomplete fuzzy linguistic preference relation. We include tools to manage incomplete information when the users express their preferences, and, in such a way, we show that the acquisition of the user profiles is improved. 2009 Elsevier B.V. All rights reserved. 1. Introduction Digital libraries are information collections that have associated services delivered to user communities using a variety of technologies [8,15,48]. Therefore, digital libraries are the logical extensions of physical libraries in the electronic information society. These extensions amplify existing resources and services. As such, digital libraries offer new levels of access to broader audiences of users and new opportunities for the library. In practice, a digital library makes its contents and services remotely accessible through networks such as the Web or limited-access intranets [39,50]. As digital libraries become commonplace and as their contents become more varied, the users expect more sophisticated services from them [8,15,48,50]. A service that is particularly important is the selective dissemination of information or filtering, to help the users to access interesting information for them. Users develop interest profiles and as new materials (books, papers, reports, and so on) are added to the collection, they are compared to the profiles and relevant items are sent to the users [39]. Moreover, digital libraries have been applied in a lot of contexts but in this paper we focus on an academic environment. University Digital Libraries (UDL) provide information resources and services to students, faculty and staff in an environment that supports learning, teaching and research [11]. Recommender systems are becoming popular tools for reducing information overload and to improve the sales in e-commerce web sites [7,9,35,40,49]. The use of this kind of systems allows to recommend resources interesting for the users, at the same time that these resources are inserted into the system. In the UDL framework, recommender systems [7,49] can be used to help users (teachers, students and library staff) to find out and select their information and knowledge sources [43]. Generally, in a recommender system the users’ information preferences can be used to define user profiles that are applied as filters to streams of documents [7,47,49]. In [45,46] we developed some recommender systems in an academic context. For instance, in [45] we proposed a fuzzy linguistic recommender system for a technology transfer office which helps researchers and environment companies allowing them to obtain information automatically about research resources (calls or projects) in their interest areas; in [46] we proposed a fuzzy linguistic recommender system to achieve major advances in the activities of UDL, which recommends researchers specialized resources and complementary resources related with their respective research areas. The problem of both recommender systems is that users must directly specify their user profiles by providing their preferences on all topics of interest and it requires too much user effort. In this paper, we focus on the idea of that a recommender system could be seen as a decision support system (DSS) [37,38,44], where the solution alternatives are the digital resources inserted into the library, and the criteria to satisfy are the user profiles. The proper use of these recommendation systems is essential to 0950-7051/$ - see front matter 2009 Elsevier B.V. All rights reserved. doi:10.1016/j.knosys.2009.07.007 * Corresponding author. E-mail addresses: cporcel@ujaen.es (C. Porcel), viedma@decsai.ugr.es (E. Herrera-Viedma). Knowledge-Based Systems 23 (2010) 32–39 Contents lists available at ScienceDirect Knowledge-Based Systems journal homepage: www.elsevier.com/locate/knosys
provide real personalized services, and it can substantially reduce A variety of techniques have been proposed as the basis for I formation overload and increase user satisfaction. Therefore, it ommender systems [7, 17, 40, 49: all of these techniques have ben- has become an important area in information systems and decision efits and disadvantages. The use of an hybrid approach is proposed upport research[37, 38, 44 So, the activity of a recommender sys- to smooth out the disadvantages of each one of them and to exploit tem can be seen as a group decision making(GDM)problem, so we their benefits [5, 13, 16]. In these kind of systems, the users'infor- can adopt the typical representation formats used in GDM, as for mation preferences can be used to define user profiles that are ap- example, fuzzy preference relations [19, 20, 28, 32, 41 This repre- plied as filters to streams of documents. The construction of sentation format presents a high expressivity and some interesting accurate profiles is a key task and the system s success will depend properties that allow us to work easily. However, in real world on a large extent on the ability of the learned profiles to represent problems it is common to find situations in which users are not the user preferences [47] able to provide all the preference values that are required, and The recommendation activity is followed by a relevance hen, we have to deal with incomplete fuzzy preference relations back phase. Relevance feedback is a cyclic process whereby the 1-3.2526,41 users feed back into the system decisions on the relevance of re- The aim of this paper is to present a new fuzzy linguistic recom- trieved documents and the system uses these evaluations to auto- mender defined in a UDL framework which overcomes the problem matically update the user profiles [17, 49. of user profile characterization observed in the recommender sys tems defined in[45, 46]. In order to improve the system perfor- 2.2. The 2-tuple fuzzy linguistic approach mance, we propose an alternative way to obtain accurate and useful knowledge about the user preferences. This new recom- The fuzzy linguistic modeling(FLM)is a tool based on the con- mender system allows users to provide their preferences by means cept of linguistic variable[52] which has given very good results for of incomplete fuzzy linguistic preference relations [1. and in such modeling qualitative information in many problems, e.g., in deci- a way, we facilitate users the expression of their preferences and, sion making 20], quality evaluation 33, 34, models of information consequently, the determination of user profiles process. The rec retrieval 23, 24, 29-31, political analysis [4. etc. ommender system is able to complete the incomplete preference The 2-tuple FLM [21 is a continuous model of representation relations using the tools proposed in [1, 2. 26. Each user profile is of information that allows to reduce the loss of information typi composed of both user preferences on topics of interest and user cal of other fuzzy linguistic approaches(classical and ordinal preferences on collaboration possibilities with other users. Then, [18,)). the recommender system is able to recommend both research re Lets=iso,., Sg be a linguistic term set with odd cardinali sources and collaboration possibilities to the users of a UDL. As where the mid term represents a indifference value and the rest of in[45, 46] we define this recommender system in a multi-granular the terms are symmetrically related to it. We assume that the fuzzy linguistic context [10, 12, 22, 27, 32, 42 In such a way, we semantics of labels is given by means of triangular membership incorporate in the recommender system flexible tools to handle functions and consider all terms distributed on a scale on which the information by allowing to represent the different concepts a total order is defined, S <S,<i<j. In this fuzzy linguisti of the system with different linguistic label sets. context, if a symbolic method [18, 20 aggregating linguistic infor- The rest of the paper is set out as follows. Section 2 presents the mation obtains a value BE[0, g], and B#10,.,g, then an preliminaries necessary to develop the proposed model. Section 3 approximation function is used to express the result in S B is rep- presents the new recommender system to the dissemination of resented by means of 2-tuples (Si, a4), S E Sand a E[-5,5)where knowledge in a UDL Section 4 reports the system evaluation and 5, represents the linguistic label of the information, and o, is a the experimental results. Finally, our conclusions are pointed out numerical value expressing the value of the translation from the in Section 5 original result B to the closest index label, i, in the linguistic term set(S S). This 2-tuple representation model defines a set of transformation functions between numeric values and 2-tuple △(B)=(S1,x)and△-l(s,x)=B∈0g][21 The computational model is defined by presenting a negation 2.1. Recommender systems operator, comparison of 2-tuples and aggregation operators [21]. Using functions A and a"that transform without loss of informa- Recommender systems could be defined as systems that pre tion numerical values into linguistic 2-tuples and viceversa, any of duce individualized recommendations as output, or have the effect the existing aggregation operators can be easily extended for of guiding the user in a personalized way to interesting or useful dealing with linguistic 2-tuples. Some examples are objects in a large space of possible options [6]. It is a research area that offers tools for discriminating between relevant and irrelevant information by providing personalized assistance for continuous information accesses [43, 49 Automatic filtering services differ from retrieval services 23, 24, 29-31]in that in filtering the corpus changes continuously, the users have long time information needs(described by means of user profiles)in- stead of introducing a query into the system, and their objective is to remove irrelevant data from incoming streams of data items Definition 2(Weighted an Let x=i(r, a1) [17, 39, 49]. A result from a recommender system is understood as (rn, an)i be a set of linguistic d W=wI hts. the a recommendation, an option worthy of consideration, while a re- ghted average Xw is sult from an information retrieval system is interpreted as a match to the users query 7]. However both systems present some anal ogies, and in this sense they could be considered a DSS [44]. In both x"[(1, a1),.,(rm,an)]=A 14(r,a)W cases. the solution alternatives would be the documents to recom- mend or retrieve and the criteria to satisfy would be the user pro- files and user queries, respectively
provide real personalized services, and it can substantially reduce information overload and increase user satisfaction. Therefore, it has become an important area in information systems and decision support research [37,38,44]. So, the activity of a recommender system can be seen as a group decision making (GDM) problem, so we can adopt the typical representation formats used in GDM, as for example, fuzzy preference relations [19,20,28,32,41]. This representation format presents a high expressivity and some interesting properties that allow us to work easily. However, in real world problems it is common to find situations in which users are not able to provide all the preference values that are required, and then, we have to deal with incomplete fuzzy preference relations [1–3,25,26,41]. The aim of this paper is to present a new fuzzy linguistic recommender defined in a UDL framework which overcomes the problem of user profile characterization observed in the recommender systems defined in [45,46]. In order to improve the system performance, we propose an alternative way to obtain accurate and useful knowledge about the user preferences. This new recommender system allows users to provide their preferences by means of incomplete fuzzy linguistic preference relations [1], and in such a way, we facilitate users the expression of their preferences and, consequently, the determination of user profiles process. The recommender system is able to complete the incomplete preference relations using the tools proposed in [1,2,26]. Each user profile is composed of both user preferences on topics of interest and user preferences on collaboration possibilities with other users. Then, the recommender system is able to recommend both research resources and collaboration possibilities to the users of a UDL. As in [45,46] we define this recommender system in a multi-granular fuzzy linguistic context [10,12,22,27,32,42]. In such a way, we incorporate in the recommender system flexible tools to handle the information by allowing to represent the different concepts of the system with different linguistic label sets. The rest of the paper is set out as follows. Section 2 presents the preliminaries necessary to develop the proposed model. Section 3 presents the new recommender system to the dissemination of knowledge in a UDL. Section 4 reports the system evaluation and the experimental results. Finally, our conclusions are pointed out in Section 5. 2. Preliminaries 2.1. Recommender systems Recommender systems could be defined as systems that produce individualized recommendations as output, or have the effect of guiding the user in a personalized way to interesting or useful objects in a large space of possible options [6]. It is a research area that offers tools for discriminating between relevant and irrelevant information by providing personalized assistance for continuous information accesses [43,49]. Automatic filtering services differ from retrieval services [23,24,29–31] in that in filtering the corpus changes continuously, the users have long time information needs (described by means of user profiles) instead of introducing a query into the system, and their objective is to remove irrelevant data from incoming streams of data items [17,39,49]. A result from a recommender system is understood as a recommendation, an option worthy of consideration, while a result from an information retrieval system is interpreted as a match to the user’s query [7]. However both systems present some analogies, and in this sense they could be considered a DSS [44]. In both cases, the solution alternatives would be the documents to recommend or retrieve and the criteria to satisfy would be the user pro- files and user queries, respectively. A variety of techniques have been proposed as the basis for recommender systems [7,17,40,49]; all of these techniques have benefits and disadvantages. The use of an hybrid approach is proposed to smooth out the disadvantages of each one of them and to exploit their benefits [5,13,16]. In these kind of systems, the users’ information preferences can be used to define user profiles that are applied as filters to streams of documents. The construction of accurate profiles is a key task and the system’s success will depend on a large extent on the ability of the learned profiles to represent the user preferences [47]. The recommendation activity is followed by a relevance feedback phase. Relevance feedback is a cyclic process whereby the users feed back into the system decisions on the relevance of retrieved documents and the system uses these evaluations to automatically update the user profiles [17,49]. 2.2. The 2-tuple fuzzy linguistic approach The fuzzy linguistic modeling (FLM) is a tool based on the concept of linguistic variable[52] which has given very good results for modeling qualitative information in many problems, e.g., in decision making [20], quality evaluation [33,34], models of information retrieval [23,24,29–31], political analysis [4], etc. The 2-tuple FLM [21] is a continuous model of representation of information that allows to reduce the loss of information typical of other fuzzy linguistic approaches (classical and ordinal [18,52]). Let S ¼ fs0; ... ; sg g be a linguistic term set with odd cardinality, where the mid term represents a indifference value and the rest of the terms are symmetrically related to it. We assume that the semantics of labels is given by means of triangular membership functions and consider all terms distributed on a scale on which a total order is defined, si 6 sj () i 6 j. In this fuzzy linguistic context, if a symbolic method [18,20] aggregating linguistic information obtains a value b 2 ½0; g, and b R f0; ... ; gg; then an approximation function is used to express the result in S. b is represented by means of 2-tuples ðsi; aiÞ, si 2 S and ai 2 ½:5; :5Þ where si represents the linguistic label of the information, and ai is a numerical value expressing the value of the translation from the original result b to the closest index label, i, in the linguistic term set ðsi 2 SÞ. This 2-tuple representation model defines a set of transformation functions between numeric values and 2-tuples DðbÞ¼ðsi; aÞ and D1 ðsi; aÞ ¼ b 2 ½0; g [21]. The computational model is defined by presenting a negation operator, comparison of 2-tuples and aggregation operators [21]. Using functions D and D1 that transform without loss of information numerical values into linguistic 2-tuples and viceversa, any of the existing aggregation operators can be easily extended for dealing with linguistic 2-tuples. Some examples are Definition 1 (Arithmetic mean). Let x ¼ fðr1; a1Þ; ... ;ðrn; anÞg be a set of linguistic 2-tuples, the 2-tuple arithmetic mean xe is computed as, xe ½ ¼ ð Þ r1; a1 ; ... ; ð Þ rn; an D Xn i¼1 1 n D1 ri ð Þ ; ai ! ¼ D 1 n Xn i¼1 bi !: ð1Þ Definition 2 (Weighted average operator). Let x ¼ fðr1; a1Þ; ... ; ðrn; anÞg be a set of linguistic 2-tuples and W ¼ fw1; ... ; wng be their associated weights. The 2-tuple weighted average xw is xw½ ¼ ð Þ r1; a1 ; ... ; ð Þ rn; an D Pn i¼1D1 ri P ð Þ ; ai wi n i¼1wi ! ¼ D Pn i¼1bi P wi n i¼1wi : ð2Þ C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39 33
C Porcel, E. Herrera-Viedma/ Knowledge-Based Systems 23(2010)32-39 TF:t,n(t)→lt,n() Linguistic hierarchies. s n(D), ar())-(n(t2)-1 Level- TF(st, e)=n(o) l(2.13) As it was pointed out in[22 this family of transformation fund tions is bijective. This result guarantees that the transformations between levels of a linguistic hierarchy are carried out without loss 2.3. Incomplete fuzzy preference relations Definition 5. A fuzzy preference relation P on a set of alternatives X=x1, ...,xn) is a fuzzy set on the product set X xX, i.e., it is characterized by a mem nction Ap:XxX→0,1] When cardinality of X is small, the preference relation may be conveniently represented by the n x n matrix P=(Pig), being Py=Ap(xi, xy)(Vi, jE1,.,nh)interpreted as the preference degree or intensity of the alternative x over x, where . P= 1/2 indicates indifference between x and Py=1 indicates that x is absolutely preferred to x. and Py>1/2 indicates that x is preferred to x. Fig. 1. Linguistic hierarchy of 3, 5 and 9 labels. However, as we have mentioned, our syster rates the multi-granular FLM based on 2-tuples, so we must define a linguis- tic preference relation as follows: Definition 6. Let X=x1.,xn) a set of alternatives and sa Let x=i(r1, linguistic term set. A linguistic preference relation P= Pii, je weights. The pp:X×X→S×10.5.0.5), where Py=Ap(, x)is a 2-tuple which denotes the preference de- 对(,2,(wx)…(m)、%2=4/2B- gree of alternative x, regarding to x. As aforementioned, in many real world GDM problems the ex- (3) perts are often not able to provide all the preference values that are required In order to model these situations, we use incomplete with B,=A (n, a,)and Bw=A"(Wi, a) fuzzy preference relations [1-3, 25, 26, 41] nportant parameter to Definition 7. A function f: X-Y is partial when not every determine is the"granularity of uncertainty". ie, the cardinality element in the set x necessarily maps onto an element in the set of the linguistic term set S. when different experts have different Y. when every element from the set X maps onto one element of uncertainty degrees on the phenomenon or when an expert has the set Y, then we have a total function. to assess different concepts, then several linguistic term sets wit a different granularity of uncertainty are necessary [22, 32. In(22) Definition 8. A two-tuple fuzzy linguistic preference relation Pon a multi-granular 2-tuple FLM based on the concept of linguistic a set of alternatives X with a partial membership function is an hierarchy is proposed. incomplete two-tuple fuzzy linguistic preference relation A linguistic hierarchy(LH), is a set of levels l(t, n(t)), where each level t is a linguistic term set with different granularity n(t) from 3. A recommender system for the dissemination of information according to their granularity, i.e. a level t +1 provides a linguistic in UDL using incomplete linguistic preference relations refinement of the previous level t. We can define a level from its redecessor level as I(t, n(t))-I(t+1, 2. n(t)-1) Table 1 shows The UDL staff manage and spread many information about re- le granularity needed in each linguistic term set of the level t search resources such as electronic books, electronic papers, depending on the value n(t) defined in the first level (3 and 7. onic journals, official dailies and so on [8, 48. Nowadays, this amount of information is growing up and they are in need of auto- A graphical example of a linguistic hierarchy is shown in Fig. 1 mated tools to filter and spread that information to the users in a In [22] a family of transformation functions between labels simple and timely manner. On the other hand, UDL need from different levels was introduced tools to help them to insert their preferences to form accurate ofi Definition 4. Let LH=U(t, n(t)) be a linguistic hierarchy whose In this section we present a new fuzzy linguistic recommender i-1. The system in which the user transformation function between a 2-tuple that belongs to level t ences represented by inco fuzzy linguistic preference rela- and another 2-tuple in level t/t is defined as tions [1. This proposal es with some advantages with
Definition 3 (Linguistic weighted average operator). Let x ¼ fðr1; a1Þ; ... ;ðrn; anÞg be a set of linguistic 2-tuples and W ¼ fðw1; aw 1 Þ; ... ; ðwn; aw n Þg be their linguistic 2-tuple associated weights. The 2-tuple linguistic weighted average xw l is xw l ð Þ r1; a1 ; w1; aw 1 ... ð Þ rn; an ; wn; aw n ¼ D Pn i¼1bi P bWi n i¼1bWi !; ð3Þ with bi ¼ D1 ðri; aiÞ and bWi ¼ D1 ðwi; aw i Þ. In any fuzzy linguistic approach, an important parameter to determine is the ‘‘granularity of uncertainty”, i.e., the cardinality of the linguistic term set S. When different experts have different uncertainty degrees on the phenomenon or when an expert has to assess different concepts, then several linguistic term sets with a different granularity of uncertainty are necessary [22,32]. In [22] a multi-granular 2-tuple FLM based on the concept of linguistic hierarchy is proposed. A linguistic hierarchy (LH), is a set of levels lðt; nðtÞÞ, where each level t is a linguistic term set with different granularity nðtÞ from the remaining of levels of the hierarchy. The levels are ordered according to their granularity, i.e., a level t þ 1 provides a linguistic refinement of the previous level t. We can define a level from its predecessor level as lðt; nðtÞÞ ! lðt þ 1; 2 nðtÞ 1Þ. Table 1 shows the granularity needed in each linguistic term set of the level t depending on the value n(t) defined in the first level (3 and 7, respectively). A graphical example of a linguistic hierarchy is shown in Fig. 1. In [22] a family of transformation functions between labels from different levels was introduced: Definition 4. Let LH ¼ S tlðt; nðtÞÞ be a linguistic hierarchy whose linguistic term sets are denoted as SnðtÞ ¼ fs nðtÞ 0 ; ... ; s nðtÞ nðtÞ1g. The transformation function between a 2-tuple that belongs to level t and another 2-tuple in level t0–t is defined as TFt t0 : l tð Þ! ; nðtÞ l t0 ; nðt 0 ð ÞÞ ; TFt t0 s nðtÞ i ; anðtÞ ¼ D D1 s nðtÞ i ; anðtÞ nðt 0 ð Þ Þ 1 nðtÞ 1 0 @ 1 A: As it was pointed out in [22] this family of transformation functions is bijective. This result guarantees that the transformations between levels of a linguistic hierarchy are carried out without loss of information. 2.3. Incomplete fuzzy preference relations Definition 5. A fuzzy preference relation P on a set of alternatives X ¼ fx1; ... ; xng is a fuzzy set on the product set X X, i.e., it is characterized by a membership function lP : X X ! ½0; 1: When cardinality of X is small, the preference relation may be conveniently represented by the n n matrix P ¼ ðpijÞ, being pij ¼ lPðxi; xjÞ ð8i; j 2 f1; ... ; ngÞ interpreted as the preference degree or intensity of the alternative xi over xj, where pij ¼ 1=2 indicates indifference between xi and xj, pij ¼ 1 indicates that xi is absolutely preferred to xj, and pij > 1=2 indicates that xi is preferred to xj. However, as we have mentioned, our system integrates the multi-granular FLM based on 2-tuples, so we must define a linguistic preference relation as follows: Definition 6. Let X ¼ fx1 ... ; xng a set of alternatives and S a linguistic term set. A linguistic preference relation P ¼ pijð8i; j 2 f1; ... ; ngÞ on X is lP : X X ! S ½ Þ 0:5; 0:5 ; ð4Þ where pij ¼ lPðxi; xjÞ is a 2-tuple which denotes the preference degree of alternative xi regarding to xj. As aforementioned, in many real world GDM problems the experts are often not able to provide all the preference values that are required. In order to model these situations, we use incomplete fuzzy preference relations [1–3,25,26,41]. Definition 7. A function f : X ! Y is partial when not every element in the set X necessarily maps onto an element in the set Y. When every element from the set X maps onto one element of the set Y, then we have a total function. Definition 8. A two-tuple fuzzy linguistic preference relation P on a set of alternatives X with a partial membership function is an incomplete two-tuple fuzzy linguistic preference relation. 3. A recommender system for the dissemination of information in UDL using incomplete linguistic preference relations The UDL staff manage and spread many information about research resources such as electronic books, electronic papers, electronic journals, official dailies and so on [8,48]. Nowadays, this amount of information is growing up and they are in need of automated tools to filter and spread that information to the users in a simple and timely manner. On the other hand, UDL users need tools to help them to insert their preferences to form accurate profiles. In this section we present a new fuzzy linguistic recommender system in which the user profiles are obtained from user preferences represented by incomplete fuzzy linguistic preference relations [1]. This proposal contributes with some advantages with Table 1 Linguistic hierarchies. Level 1 Level 2 Level 3 l(t,n(t)) l(1,3) l(2,5) l(3,9) l(t,n(t)) l(1,7) l(2,13) Fig. 1. Linguistic hierarchy of 3, 5 and 9 labels. 34 C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39
C Porcel, E. Herrera-Viedma/ Knowledge-Based Systems 23(2010)32-39 Resources presentations 以)<~uo-cy} Recom Fig. 2. Operating scheme. regard to previous systems [46, 45 because it facilitates the 3. 1. Resources representation expression of their preferences to the users and reduces the user The considered resources are journal articles, conference contri- effort to characterize their user profiles. It is applied ise butions, book chapters, books or edited books. Once the library UDL users on the best research resources that could satisfy their staff insert all the available information about a new resource information needs in UDL Moreover, the system recommends col- the system obtains an internal representation mainly based in laboration possibilities to meet another researchers of related the resource scope. We use the vector model [36] to represent the areas which could collaborate with them in projects or interest resource scope. Thus, to sent a resource i we use a classifica- works. In such a way, this new recommender system improves tion composed by 25 disciplines (see Fig 3). In each position we the services that a UDL provides to the users, because it is easier store a linguistic 2-tuple value representing the importance degree the time cost to establish the user profile d it allows to decrease of the resource scope with respect to the discipline represented by to obtain the knowledge about the users an that position In Fig. 2 we can see the basic operating scheme, which is ex- plained in the following subsections WR=(WR1,WR2,…,VR25). Then, each component VRy E SI, withj=(1,., 25]. indicates the linguistic importance degree of the discipline j with regard to In the proposed system, the user-system communication is car- the resource i. These importance degrees are assigned by the ed out by using a multi-granular fuzzy linguistic approach library staff when they add a new resource. 22, 32, in order to allow a higher flexibility in the communication processes of the system. The system uses different label sets (S1, S2,.)to represent the different concepts to be assessed in 3. 1.2. User profiles its filtering activity. These label sets, Si, are chosen from those label The user profiles are composed of two kinds of user prefe sets that compose a lh, i. e, S E LH. We should point out that the number of different label sets that we can use is limited by the (1)User preferences on topics of interest, and umber of levels of LH, and therefore, in many cases the label set (2)User preferences on collaboration possibility with other Si and si can be associated to a same label set of LH but with diffe users take into account the follow g on the concept to be modeled.We lowing concepts that can be assessed in the syste <s The main contribution of this proposal is how users provide eir preferences on topics of interest used to represent the source resources. In previous proposals [45, 46] we represented such user Importance degree of a discipline with respect to a resource preferences using the vector model [36). The problem is that the user preferences(S1). users must insert or edit all the features corresponding to the dis Relevance degree of a resource for a user(S2). ciplines, i. e, in our case 25 categories. Thus, in previous proposal Compatibility degree between two users(S3). we worked with vectors composed of 25 positions(each one corre- Preference degree of a resource regarding another one(S4) sponding to a discipline), but there could exist cases in which this number could be greater. In such a way, users have to perform a To reduce this effort and make the process for ac pics of interest. Following the linguistic hierarchy shown in Fig. 1, in our system great effort to provide their preferences about to we use the level 2(5 labels) to assign importance and egrees(S=S and S4=S), and the level 3(9 labels) eferences easier, in this model we propose an alternative metho levance and compatibility degrees(S2=S and S3 to obtain the user prefe this lh, the linguistic terms in each level are We ask users to provide their preferences on some research re- sources, usually a limited number of resources, four or five. The .S=bo=Null=N, b,=Low=L, b2=Medium =M, b3=High= choice of research resources is made by the personal staff tanking H, b4=Total= TI into account the relevance supplied by the users. As in 41 we pro- S=(co=Null=N, C1= Very_Low=VL, C2= Low=L, C3= pose users to represent their preferences by means of incomplete More_-Low= MLL, C4= Medium= M, Cs== More-Less_ High fuzzy linguistic preference relations. Then, the system presents MLH. C6= High= H, c,=Very -High= VH, cs=Total= TI users only a selection of the most representative resources, and
regard to previous systems [46,45] because it facilitates the expression of their preferences to the users and reduces the user effort to characterize their user profiles. It is applied to advise UDL users on the best research resources that could satisfy their information needs in UDL. Moreover, the system recommends collaboration possibilities to meet another researchers of related areas which could collaborate with them in projects or interest works. In such a way, this new recommender system improves the services that a UDL provides to the users, because it is easier to obtain the knowledge about the users and it allows to decrease the time cost to establish the user profiles. In Fig. 2 we can see the basic operating scheme, which is explained in the following subsections. 3.1. Information representation In the proposed system, the user-system communication is carried out by using a multi-granular fuzzy linguistic approach [22,32], in order to allow a higher flexibility in the communication processes of the system. The system uses different label sets ðS1; S2; ...Þ to represent the different concepts to be assessed in its filtering activity. These label sets, Si, are chosen from those label sets that compose a LH, i.e., Si 2 LH. We should point out that the number of different label sets that we can use is limited by the number of levels of LH, and therefore, in many cases the label sets Si and Sj can be associated to a same label set of LH but with different interpretations, depending on the concept to be modeled. We take into account the following concepts that can be assessed in the system: Importance degree of a discipline with respect to a resource scope or user preferences (S1). Relevance degree of a resource for a user (S2). Compatibility degree between two users (S3). Preference degree of a resource regarding another one (S4). Following the linguistic hierarchy shown in Fig. 1, in our system we use the level 2 (5 labels) to assign importance and preference degrees (S1 ¼ S5 and S4 ¼ S5 ), and the level 3 (9 labels) to assign relevance and compatibility degrees (S2 ¼ S9 and S3 ¼ S9 ). Using this LH, the linguistic terms in each level are S5 ¼ fb0 ¼ Null ¼ N; b1 ¼ Low ¼ L; b2 ¼ Medium ¼ M; b3 ¼ High ¼ H;b4 ¼ Total ¼ Tg S9 ¼ fc0 ¼ Null ¼ N; c1 ¼ Very Low ¼ VL; c2 ¼ Low ¼ L; c3 ¼ More Less Low ¼ MLL; c4 ¼ Medium ¼ M; c5 ¼¼ More Less High MLH; c6 ¼ High ¼ H; c7 ¼ Very High ¼ VH; c8 ¼ Total ¼ Tg 3.1.1. Resources representation The considered resources are journal articles, conference contributions, book chapters, books or edited books. Once the library staff insert all the available information about a new resource, the system obtains an internal representation mainly based in the resource scope. We use the vector model [36] to represent the resource scope. Thus, to represent a resource i, we use a classification composed by 25 disciplines (see Fig. 3). In each position we store a linguistic 2-tuple value representing the importance degree of the resource scope with respect to the discipline represented by that position: VRi ¼ ð Þ VRi1; VRi2; ... ; VRi25 : ð5Þ Then, each component VRij 2 S1, with j ¼ f1; ... ; 25g, indicates the linguistic importance degree of the discipline j with regard to the resource i. These importance degrees are assigned by the library staff when they add a new resource. 3.1.2. User profiles The user profiles are composed of two kinds of user preferences: (1) User preferences on topics of interest, and (2) User preferences on collaboration possibility with other users. The main contribution of this proposal is how users provide their preferences on topics of interest used to represent the source resources. In previous proposals [45,46] we represented such user preferences using the vector model [36]. The problem is that the users must insert or edit all the features corresponding to the disciplines, i.e., in our case 25 categories. Thus, in previous proposals we worked with vectors composed of 25 positions (each one corresponding to a discipline), but there could exist cases in which this number could be greater. In such a way, users have to perform a great effort to provide their preferences about topics of interest. To reduce this effort and make the process for acquiring the user preferences easier, in this model we propose an alternative method to obtain the user preferences on topics of interest. We ask users to provide their preferences on some research resources, usually a limited number of resources, four or five. The choice of research resources is made by the personal staff tanking into account the relevance supplied by the users. As in [41] we propose users to represent their preferences by means of incomplete fuzzy linguistic preference relations. Then, the system presents users only a selection of the most representative resources, and Users Resources Resource representations VRi Acquiring user's preferences Incomplete preference relation P Computing missing information P* Aggregation VUx Recommendations Matching process Fig. 2. Operating scheme. C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39 35
C Porcel, E. Herrera-Viedma/ Knowledge-Based Systems 23(2010)32-39 D Agmcuture, animal breeding and faning D vegetal and animal talog anut ecology a Boterhnnlooy, mol d celular biology and genetics v■Fnsi ull L Materials schane e and technotogy a social scienee NA心 mputers stlence and technology 口 Fnurey and comhushbig L Pharmacology and pharmacy dium a Histor and an ns. sonutrustlon and Null 口 medicins and veterinary D Environment and environmental technolo 口 Muttp-discroana NA口spy Null a Psychology and educaton sciences L Chemistny and chemistry tethnoidgr D Telecommunications, electric engineenng, electronics and Fig. 3. Interface to define the disciplines of the resource scope. the users provide their preferences about these resources by mea of an incomplete fuzzy preference relation. Furthermore, according i.e., user preference vector, firstly to results presented in [2. it is enough that the users provide only preference degrees on each consid a row of the preference relation. Then, we use the method pro- to the preference relation P, and posed in [2] to complete the relations. Once the system completes preference degrees together with the vectors that represent ach research resource to obtain the user preference vector. possible to obtain a vector representing the user preferences on The preference degrees coincides with the dominance the topics of interest. Next, we explain this process in detail degrees of a linguistic preference relation [19 To obtain them we propose the application of the arithmetic mean (1)Acquiring the user preferences on a limited number of x(Definition 1). Then, the preference degree of the research resources: At the beginning, the main goal is to resource i for the expert called DGi, is computed as follows: help the users to provide their preferences assuring that these preferences are as consistent as possible. The system DG=X Pi.,Ps] shows users the five most representative resources, R (r1., Ts), and asks them to express their preferences by VUx, .. VUxs) from the aggregation of the vectors that represents neans of an incomplete fuzzy linguistic preference relation the characteristics of the chosen research resources, i.e. ( VRI (see Fig. 4). The users only fill those preferences that they VRs weighted by means of the user preference degrees wish, assigning labels of S4. In the preference relation, each (DGI ... DGs). To do that, we use the linguistic weighted average preference value Py represents the linguistic preference operator defined in Definition 3, and then each position degree of resource i over the resource j according to the user k=(1,., 25)of the vector VUx, is computed as follows provide a relation with only one row of preference values: VUxk=X [(VRik, DG1),., (VRsk, DG5) P12P13P14P15 On the other hand, to complete the user profile, the system asks XXX every user to express his/her collaboration preferences, i. e, if he/ XX (6 ons on collaboration possibili- ties with others users. This could help users to develop multi-dis- ciplinar works or participate in collaborative research projects [46]. They should respond to this question with"Yes"or"No Then, the system completes the preference relation P using the od proposed in [2 and obtains the relation P- 3. 2. Recommendation strategy P12P13 In this phase the system generates the recommendations to de- P= P31 P32 P34 P3s (7) liver the information resources to the fitting users. This process is P41P42P43 based on a matching process developed between user profiles and P51 P52 P53 P54 resource representations [17, 36. To do that, we can use different kinds of similarity measures, such as euclidean distance or Cosine Measure. Particularly we use the standard cosine measure[36].As where Py E Sa are the degrees inserted by the user about the pref- the components of the vectors used to represent user profiles and ence. ant nguistic values, then we define the nd each p is the estimated degree for the user about his her cosine measure in a 2-tuple linguistic context. Given two vectors of nce of the resource x, with respect to x 2-tuple linguistic values
the users provide their preferences about these resources by means of an incomplete fuzzy preference relation. Furthermore, according to results presented in [2], it is enough that the users provide only a row of the preference relation. Then, we use the method proposed in [2] to complete the relations. Once the system completes the fuzzy linguistic preference relation provided by the user, it is possible to obtain a vector representing the user preferences on the topics of interest. Next, we explain this process in detail: (1) Acquiring the user preferences on a limited number of research resources: At the beginning, the main goal is to help the users to provide their preferences assuring that these preferences are as consistent as possible. The system shows users the five most representative resources, R ¼ fr1 ... ; r5g, and asks them to express their preferences by means of an incomplete fuzzy linguistic preference relation (see Fig. 4). The users only fill those preferences that they wish, assigning labels of S4. In the preference relation, each preference value pij represents the linguistic preference degree of resource i over the resource j according to the user feeling. As aforementioned, the simplest case would be to provide a relation with only one row of preference values: P ¼ p12 p13 p14 p15 x xxx x x x x xx x x xx x x 0 BBBBBB@ 1 CCCCCCA : ð6Þ Then, the system completes the preference relation P using the method proposed in [2], and obtains the relation P : P ¼ p12 p13 p14 p15 p 21 p 23 p 24 p 25 p 31 p 32 p 34 p 35 p 41 p 42 p 43 p 45 p 51 p 52 p 53 p 54 0 BBBBBB@ 1 CCCCCCA ; ð7Þ where p1j 2 S4 are the degrees inserted by the user about the preferences of the resource x1 with respect to xj, pii represents indifference, and each p ij is the estimated degree for the user about his/her preference of the resource xi with respect to xj. (2) In order to obtain user preferences on topic of interest, i.e., user preference vector, firstly we calculate the user preference degrees on each considered resource according to the preference relation P , and secondly, we use this preference degrees together with the vectors that represent each research resource to obtain the user preference vector. The preference degrees coincides with the dominance degrees of a linguistic preference relation [19]. To obtain them we propose the application of the arithmetic mean xe (Definition 1). Then, the preference degree of the resource i for the expert called DGi, is computed as follows: DGi ¼ xe p i1; ... ; p i5 : ð8Þ Then, to obtain the user preference vector x, i.e. VUx ¼ ðVUx1; VUx2; ... ; VUx25Þ, from the aggregation of the vectors that represents the characteristics of the chosen research resources, i.e., fVR1; ... ; VR5g, weighted by means of the user preference degrees fDG1; ... ; DG5g. To do that, we use the linguistic weighted average operator defined in Definition 3, and then each position k ¼ f1; ... ; 25g of the vector VUx, is computed as follows: VUxk ¼ xw l ½ ð Þ VR1k;DG1 ; ... ; ð Þ VR5k;DG5 : ð9Þ On the other hand, to complete the user profile, the system asks every user to express his/her collaboration preferences, i.e., if he/ she wants to receive recommendations on collaboration possibilities with others users. This could help users to develop multi-disciplinar works or participate in collaborative research projects [46]. They should respond to this question with ‘‘Yes” or ‘‘No”. 3.2. Recommendation strategy In this phase the system generates the recommendations to deliver the information resources to the fitting users. This process is based on a matching process developed between user profiles and resource representations [17,36]. To do that, we can use different kinds of similarity measures, such as euclidean distance or Cosine Measure. Particularly, we use the standard cosine measure [36]. As the components of the vectors used to represent user profiles and research resources are 2-tuple linguistic values, then we define the cosine measure in a 2-tuple linguistic context. Given two vectors of 2-tuple linguistic values: Fig. 3. Interface to define the disciplines of the resource scope. 36 C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39
C Porcel, E. Herrera-Viedma/ Knowledge-Based Systems 23(2010)32-39 Null Nu Fig. 4. Interface to define the user preferences. V1=(v1,.xn1),(v12,av12),…,(v125,av125) Contingency table for the resources. V2=(V21,a21),(v2,xv2),……,(v25v25) then the linguistic similarity between both, called oI(VI, V2)ESI Irrelevant G(V1,V2) 1(△(wk,xk)×△(2x, it fulfills the proposed innovations, that is, the recommended infor- mation is useful and interesting for the users, reducing the effort x2()y2(-a) and making easier the process for acquiring the users preferences. Now we have implemented a trial version, in which the system works only with a few researchers where g is the granularity of S, and(vik, avik)is the 2-tuple linguistic value of term k in the vector(vi) 4.1. Evaluation metrics When a new resource i is inserted into the system, we calculate the linguistic similarity measures, aI(VR. VU). between the repre- In the scope of recommender systems, precision, recall and F1 entation vector of this new resource(VR) and all the user prefer e measures widely used to evaluate the quality of the recom- nce vectors, VU VUm), where m is the number of users in mendations [9, 14,, 51]. We use them to compare the new pro- the system. These user preference vectors are obtained as we ha posal with previous systems. To calculate th tic threshold value(v)to filter the output of the system. Next, the The G an n neea le to categorize the items with respect to the a contingency ta Then, if ar(VR, VU)>w, the user j is selected to receive recom- informatic The items are classified both as relevant or mendations about resource i. Previously, we have defined a linguis- irrelevant, and selected( recommended to the user)or not selected. ntingency table(see Table 2) is created using these fou system applies to each or(VR, VU) the transformation function de- categories. fined in Definition 4, to obtain the relevance degree of the resource Precision is defined as the ratio of the selected relevant items to i for the user j, expressed using a label of the set Sz the selected items, that is, it measures the probability of a selecte The collaboration preferences provided by the users are used to item be relevant collaborators W/N For the users of w/ n the system has finished the P=Nr recommendation process, and therefore it sends them the resource information together with its linguistic relevance degree. Recall is calculated as the ratio of the selected relevant items to For the users in wc the system calculates the collaboration pos- the relevant items, that is, it represents the probability of a rele sibilities. To do it, between each two users x,y E wc, the system vant items be selected performs the following step (1) Calculate the linguistic similarity measure between both users, G(VUx, VU,). Fl is a combination metric that gives equal weight to both 2)Obtain the linguistic compatibe egree between both cision and recall sers, which must be expressed in S3. To do that, we apply the transformation function defined in 4 on a(VUx, VUy 2×R1×P1 F1=R1 Finally the system sends to the users of wc the resource infor- mation, its calculated linguistic relevance degree and the collabo- 4.2. Experimental results degree The purpose of the experiment is to test the performance of the proposed system, so we compared the recommendations made by In this section we present the evaluation of the proposed sys- system assessing the relevance of the r e the library staff.When 4. Experiment and evaluation the system with the information provided by t ommended resource. i.e tem. The main focus in evaluating the system is to determinate if they provide their opinions about the recommendation supplied
V1 ¼ ð Þ ð Þ v11; av11 ;ð Þ v12; av12 ; ... ; ð Þ v125; av125 ; and V2 ¼ ð Þ ð Þ v21; av21 ;ð Þ v22; av22 ; ... ; ð Þ v225; av225 ; then the linguistic similarity between both, called rlðV1; V2Þ 2 S1 is defined as rlð Þ V1; V2 ¼ D g P25 k¼1 D1 ð Þ v1k; av1k D1 ð Þ v2k; av2k ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi P25 k¼1 D1 ð Þ v1k; av1k 2 r ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi P25 k¼1 D1 ð Þ v2k; av2k 2 r 0 BB@ 1 CCA; ð10Þ where g is the granularity of S1 and ðvik; avikÞ is the 2-tuple linguistic value of term k in the vectorðViÞ. When a new resource i is inserted into the system, we calculate the linguistic similarity measures, rlðVRi; VUjÞ, between the representation vector of this new resource ðVRiÞ and all the user preference vectors, fVU1; ... ; VUmg; where m is the number of users in the system. These user preference vectors are obtained as we have indicated in Section 3.1.2. Then, if rlðVRi; VUjÞ P w, the user j is selected to receive recommendations about resource i. Previously, we have defined a linguistic threshold value ðwÞ to filter the output of the system. Next, the system applies to each rlðVRi; VUjÞ the transformation function de- fined in Definition 4, to obtain the relevance degree of the resource i for the user j, expressed using a label of the set S2. The collaboration preferences provided by the users are used to classify the selected users in two sets, collaborators UC and noncollaborators UN. For the users of UN the system has finished the recommendation process, and therefore it sends them the resource information together with its linguistic relevance degree. For the users in UC the system calculates the collaboration possibilities. To do it, between each two users x; y 2 UC, the system performs the following steps: (1) Calculate the linguistic similarity measure between both users, rlðVUx; VUyÞ. (2) Obtain the linguistic compatibility degree between both users, which must be expressed in S3. To do that, we apply the transformation function defined in 4 on rlðVUx; VUyÞ. Finally the system sends to the users of UC the resource information, its calculated linguistic relevance degree and the collaboration possibilities characterized by its linguistic compatibility degrees. 4. Experiment and evaluation In this section we present the evaluation of the proposed system. The main focus in evaluating the system is to determinate if it fulfills the proposed innovations, that is, the recommended information is useful and interesting for the users, reducing the effort and making easier the process for acquiring the user’s preferences. Now we have implemented a trial version, in which the system works only with a few researchers. 4.1. Evaluation metrics In the scope of recommender systems, precision, recall and F1 are measures widely used to evaluate the quality of the recommendations [9,14,43,51]. We use them to compare the new proposal with previous systems. To calculate these metrics we need a contingency table to categorize the items with respect to the information needs. The items are classified both as relevant or irrelevant, and selected (recommended to the user) or not selected. The contingency table (see Table 2) is created using these four categories. Precision is defined as the ratio of the selected relevant items to the selected items, that is, it measures the probability of a selected item be relevant: P ¼ Nrs Ns : ð11Þ Recall is calculated as the ratio of the selected relevant items to the relevant items, that is, it represents the probability of a relevant items be selected: R ¼ Nrs Nr : ð12Þ F1 is a combination metric that gives equal weight to both precision and recall: F1 ¼ 2 R1 P1 R1 þ P1 : ð13Þ 4.2. Experimental results The purpose of the experiment is to test the performance of the proposed system, so we compared the recommendations made by the system with the information provided by the library staff. When the users receive a recommendation, they provide a feedback to the system assessing the relevance of the recommended resource, i.e., they provide their opinions about the recommendation supplied Fig. 4. Interface to define the user preferences. Table 2 Contingency table for the resources. Selected Not selected Total Relevant Nrs Nrn Nr Irrelevant Nis Nin Ni Total Ns Nn N C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39 37
C Porcel, E Herrera-Viedma/Knowledge-Based Systems 23(2010)32-39 by the system. If they are satisfied with the recommendation, they by the system with the recommendations provided by the library provide a higher value. We use that feedback information to evalu- staff and the relevance degrees inserted by the users. with this ate the system, applying the measures described in the previous information, we build the contingency table for the recommended resources. It is shown in table 3 We considered a data set with 30 research resources of different From this contingency table, we obtain areas, collected by the library staff from different information cision, recall and Fl which are shown in 4. The average of sources. These resources were included into the system following precision, recall and Fl metrics are 67.50%, 61.39% and 63.51%, the indications described in Section 3.1.1. Initially we limited these respectively. The Fig. 5 shows a graph with the precision, recall experiments to 6 users; all of them completed the registration pro- and F1 values for each user. These values reveal a good perfor cess and they inserted their preferences about the five most rele- mance of the proposed system, and therefore, a good user From this information provided by the users, the system builds satisfaction. vant resources presented by the system (like in Fig 4) the user profiles. These user profiles obtained from the provided preferences and the resources previously inserted, constituted 5 Conclusions our training data set. Then, we added 20 new resources that consti- tuted the test data set. The system filtered these 20 resources and Digital libraries can serve as powerful tools for universities to recommended each one to the suitable users. To obtain data to reach out and expand their sphere of influence in the society. compare, the 20 new resources also were recommended using UDL provides effective channels for the dissemination of research the advices of the library staff. information But users of udl need tools to assist them in their For example, for the user 1, the system selected 4 resources processes of information gathering because of the large amount relevant. However, from the information provided by the library of information available on these systems, as for example, recom- staff and the user feedback, we could see that the system selected mender systems 1431 1 irrelevant resource for user 1. and it didnt select 2 resources that We have proposed a multi-granular linguistic recommender library staff considered relevant for the user 1. Then, to build the system in this research topic 1451. However, the process for acqui contingency table, we compared the recommendations provided ing the user profiles requires great effort, and sometimes, it is com- plicated by the great quantity of information that the user has to give to characterize their feeling on topics of interest. In this paper we have proposed a new method to overcome this problem. Users Experimental contingency table. do not directly provide the user preference vectors that character ize their profiles. They provide preferences on some research re- User 1 User 3 User 4 User 5 User 6 sources and from this information we calculate their respective preference vectors on topics of interest. Furthermore, to facilitate 2154 21133 the process for acquiring the user preferences on the resources we allow users to provide their preferences by means of incom 4 plete fuzzy linguistic preference relations. The user profile is com- pleted with user preferences on the collaboration possibilities with other users. Therefore, this recommender system acts as a decision support system that makes decisions about both the resources that Table 4 could be interesting for a researcher and his/her collaboration pos Detailed experiment results for the recommendations. sibilities with other researchers to form interesting working groups. The experimental results show the user satisfaction with Precision(%) Recall(%) F1(‰) the received recommendations Acknowledgements 4000 72,73 User 6 75.00 7500 e This paper has been developed with the financing of FEDER ds in FUZZYLING project (TIN2007-61079). PETRI project (PET2007-0460), and project of Ministry of Public Works(90/07). 40.00% User 4 一 Precision- RecaH F1 Fig. 5. Experiment results
by the system. If they are satisfied with the recommendation, they provide a higher value. We use that feedback information to evaluate the system, applying the measures described in the previous section. We considered a data set with 30 research resources of different areas, collected by the library staff from different information sources. These resources were included into the system following the indications described in Section 3.1.1. Initially we limited these experiments to 6 users; all of them completed the registration process and they inserted their preferences about the five most relevant resources presented by the system (like in Fig. 4). From this information provided by the users, the system builds the user profiles. These user profiles obtained from the provided preferences and the resources previously inserted, constituted our training data set. Then, we added 20 new resources that constituted the test data set. The system filtered these 20 resources and recommended each one to the suitable users. To obtain data to compare, the 20 new resources also were recommended using the advices of the library staff. For example, for the user 1, the system selected 4 resources as relevant. However, from the information provided by the library staff and the user feedback, we could see that the system selected 1 irrelevant resource for user 1, and it didn’t select 2 resources that library staff considered relevant for the user 1. Then, to build the contingency table, we compared the recommendations provided by the system with the recommendations provided by the library staff and the relevance degrees inserted by the users. With this information, we build the contingency table for the recommended resources. It is shown in Table 3. From this contingency table, we obtain the corresponding precision, recall and F1 which are shown in Table 4. The average of precision, recall and F1 metrics are 67.50%, 61.39% and 63.51%, respectively. The Fig. 5 shows a graph with the precision, recall and F1 values for each user. These values reveal a good performance of the proposed system, and therefore, a good user satisfaction. 5. Conclusions Digital libraries can serve as powerful tools for universities to reach out and expand their sphere of influence in the society. UDL provides effective channels for the dissemination of research information. But users of UDL need tools to assist them in their processes of information gathering because of the large amount of information available on these systems, as for example, recommender systems [43]. We have proposed a multi-granular linguistic recommender system in this research topic [46]. However, the process for acquiring the user profiles requires great effort, and sometimes, it is complicated by the great quantity of information that the user has to give to characterize their feeling on topics of interest. In this paper we have proposed a new method to overcome this problem. Users do not directly provide the user preference vectors that characterize their profiles. They provide preferences on some research resources and from this information we calculate their respective preference vectors on topics of interest. Furthermore, to facilitate the process for acquiring the user preferences on the resources we allow users to provide their preferences by means of incomplete fuzzy linguistic preference relations. The user profile is completed with user preferences on the collaboration possibilities with other users. Therefore, this recommender system acts as a decision support system that makes decisions about both the resources that could be interesting for a researcher and his/her collaboration possibilities with other researchers to form interesting working groups. The experimental results show the user satisfaction with the received recommendations. Acknowledgements This paper has been developed with the financing of FEDER funds in FUZZYLING project (TIN2007-61079), PETRI project (PET2007-0460), and project of Ministry of Public Works (90/07). Table 4 Detailed experiment results for the recommendations. Precision (%) Recall (%) F1 (%) User 1 75,00 60,00 66,67 User 2 66,67 66,67 66,67 User 3 75,00 50,00 60,00 User 4 33,33 50,00 40,00 User 5 80,00 66,67 72,73 User 6 75,00 75,00 75,00 Average 67,50 61,39 63,51 Table 3 Experimental contingency table. User 1 User 2 User 3 User 4 User 5 User 6 Nrs3 2 3 1 4 3 Nrn2 1 3 1 2 1 Nis1 1 1 2 1 1 Nr5 3 6 2 6 4 Ns4 3 4 3 5 4 Fig. 5. Experiment results. 38 C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39
C Porcel, E. Herrera-Viedma/ Knowledge-Based Systems 23(2010)32-39 [27] E. Herrera-Viedma, O Cordon, M. Luque, A.G. Lopez, A.M. Munoz, A model F.J. Cabrerizo, F. Chiclana, F. Herrera, E. Herrer relations. [28]E. He [2]S. Alonso, F. Chiclana, F. Herrera, E. Herrera-ViedI C PorceL,A Systems Man and Cybernetics Part A: Systems and Humans 32 (3)(2002)39 International Journal of Intelligent Systems 23(2008)155- 29 edma, A.G. Lopez-Herrera, A model of information retrieval [3] S. Alonso, E. Herrera-Viedma, F. system with unbalanced fuzzy lin lligent Systems22(11)(2007)1197-1214 information, International Journal of International Journal of Information Technology and Decision Making 8(2) 130. Herrera- viedma, A.G. Lope2-Herrera, M Luque. C Porcel, A fuzzy linguistic aking in politics. A linguistic fuzzy-set approach [31]E. Herrera-Viedma, A.G. Lopez-Herrer dation. in: etrieval system, Intem :=2 6 tems, User Modeling and User-Adapted nteraction12(4)(2002)331-370. [32] E. Herrera-Viedma, L Martinez, F. Mata, F. Chicla 四是BMx2 system model for group decision-making problems with multi- nguistic preference relations, IEEE Transactions on Fuzzy Systems 13(5) (2005)644-658. NSF-EU DELOS Working Group Report, 2003. cel, Evaluating [9 Y. Cao, Y. Li, An intelligent fuzzy-based recommendation system fo and strategy-oriented aggregation approach on the assessment of supply 134/ E Herrology57(4)(2006)538-549 ing, Appl y linguistic era-viedma, E Peis, Evaluating the informative quality of documents in performance. European Journal of Operational Research 177(2)(2007)1013- computing with ent39(2)(2003)233-249 MH n devechoaon As tine ot liter a abrade& nora 35 er system for udents, Expert Systems with Applications 34(2 [36] R.R. Korfhage, Information Storage and Retrieval, Wiley Computer Publishing. [12] Z Chen, D Ben-Arieh, On the fusion of multi-granularity linguistic label sets in group decision making, Computers and Industrial Engineering 51(3)(2006 port: an editorial [13] M. Claypool, A. Gokhale, T. Miranda, Combining content-based [38]J. Malinowski, T. Weitzel, T. Keim, for team staffing. collaborative filters in an online newpaper, in: Proceedings of the ACM automated relational recommendation approach, Decision Support System orkshop on Recommender Systems-lmplementation and Evaluation, 1999 [39] G. Marchionini, Research and develop indigitallibrarieshttp:// I 14 CW. Cleverdon. E M. Keen, Factors determining the performance of indexing (40)L Martinez, M. Barranco, LG. Perez, M. Espinil edge based recommender system with granular 15 calves, E.A Fox, LT Watson, NA Kipp, Streams, structures, spaces, tional Journal of Computational Intelligence Systems 1(3)(2008) s, societies(5 s): a formal model for digital libraries, ACM Transactions -236. 4 knowledge based recommender systems using incomplete linguistic Riedl, Combining collaborative filtering with personal agents for bette Uncertainty, Fuzziness and wledge-Based Systems 16 (2)(2008)33-56 42 LG. Perez, M. Barranco, A multi-granular linguistic based-content [17 U. Hanani, B. Shapira, P. Shoval, Information filtering: overview of issues lation model, International Journal of Intelligent Systems 22(5) esearch and systems, User Modeling and User-Adapted Interaction 11(2001)[431 AW.Neumann, Recommender Systems for Information Pr 8]F nformation, IEEE Transactions on Systems Man and Cybernetics Part A141 Pasi, Modeling users界B要Hdb or information access [19 F. Herrera, E. Herrera-Viedma, Choice functions and mechanisms for linguist eference relations Applications 36(2009)5773-5182 inguistic modeling Expert Systems with [201E. Herrera, E. Herrera-Viedma, IL Verdegay, Direct approach processes in 146 C Porcel, J.M. Moreno, E Herrera-viedma A muliti-disciplinar recommender stems79(1996)175-190. B2s2mputing with words, IEEE Transation a inguistic represent asion model for 1471 LM. Quiroga, Mostafa An experiment in building profiles in information 四m gement38(2002)671-694 [22]L Martinez, F. Herrera, A model based cybernetics31(2)(2001)227-234. [49 P. Reisnick, HR Varian, Recommender systems, Special issue of Communications of the ACM 40(3)(1997)56-59. Society for Information Science and Technology 52(6)(2001)460-475 tal revolution, The journal of Academic [24] E. Herrera-Viedma, An information retrieval system with ordinal linguistic tianshi34(2)(2008)145-152. [51] B. Sarwar. J. Riedl, Analysis of recommenda iness and Ki algorithms for e-commerce. in: Proceedings of ACM model for group decision making with e fuzzy preference relations, IE [52]LA. Zadeh, The concept of a linguistic variable and its applications to [26] E. Herrera-Viedma, F Chiclana, F. Herrera, S. Alonso, Group decision making LA. Zadeh. The cond nodel with incomplete fuzzy preference relations based on additive approximate reasoning, Part ll, Information Sciences 8(1975)301-357 linguistic variable and its applications to Cybernetics37(1)(2007)176-189 approximate reasoning, Part Ill, Information Sciences 9(1975)43-80
References [1] S. Alonso, F.J. Cabrerizo, F. Chiclana, F. Herrera, E. Herrera-Viedma, Group decision-making with incomplete fuzzy linguistic preference relations, International Journal of Intelligent Systems 24 (2) (2009) 201–222. [2] S. Alonso, F. Chiclana, F. Herrera, E. Herrera-Viedma, J. Alcalá-Fdez, C. Porcel, A consistency-based procedure to estimating missing pairwise preference values, International Journal of Intelligent Systems 23 (2008) 155–175. [3] S. Alonso, E. Herrera-Viedma, F. Chiclana, F. Herrera, Individual and social strategies to deal with ignorance situations in multi-person decision making, International Journal of Information Technology and Decision Making 8 (2) (2009) 313–333. [4] B. Arfi, Fuzzy decision making in politics. A linguistic fuzzy-set approach (LFSA), Political Analysis 13 (1) (2005) 23–56. [5] C. Basu, H. Hirsh, W. Cohen, Recommendation as classification: using social and content-based information in recommendation, in: Proceedings of the Fifteenth National Conference on Artificial Intelligence, 1998, pp. 714–720. [6] R. Burke, Hybrid recommender systems, User Modeling and User-Adapted Interaction 12 (4) (2002) 331–370. [7] R. Burke, Hybrid web recommender systems, in: P. Brusilovsky, A. Kobsa, W. Nejdl, (Eds.), The Adaptive Web, LNCS 4321, 2007, pp. 377–408. [8] J. Callan et al., Personalisation and recommender systems in digital libraries, Joint NSF–EU DELOS Working Group Report, 2003. [9] Y. Cao, Y. Li, An intelligent fuzzy-based recommendation system for consumer electronic products, Expert Systems with Applications 33 (2007) 230–240. [10] S.L. Chang, R.C. Wang, S.Y. Wang, Applying a direct multigranularity linguistic and strategy-oriented aggregation approach on the assessment of supply performance, European Journal of Operational Research 177 (2) (2007) 1013– 1025. [11] H. Chao, Assessing the quality of academic libraries on the web: the development and testing of criteria, Library & Information Science Research 24 (2002) 169–194. [12] Z. Chen, D. Ben-Arieh, On the fusion of multi-granularity linguistic label sets in group decision making, Computers and Industrial Engineering 51 (3) (2006) 526–541. [13] M. Claypool, A. Gokhale, T. Miranda, Combining content-based and collaborative filters in an online newpaper, in: Proceedings of the ACM SIGIR-99 Workshop on Recommender Systems-Implementation and Evaluation, 1999. [14] C.W. Cleverdon, E.M. Keen, Factors determining the performance of indexing systems. Test results, vol. 2. ASLIB Cranfield Research Project, Cranfield, Bedford, England, 1966. [15] M.A. Gonçalves, E.A. Fox, L.T. Watson, N.A. Kipp, Streams, structures, spaces, scenarios, societies (5 s): a formal model for digital libraries, ACM Transactions on Information Systems 22 (2) (2004) 270–312. [16] N. Good, J.B. Schafer, J.A. Konstan, A. Borchers, B.M. Sarwar, J.L. Herlocker, J. Riedl, Combining collaborative filtering with personal agents for better recommendations, in: Proceedings of the Sixteenth National Conference on Artificial Intelligence, 1999, pp. 439–446. [17] U. Hanani, B. Shapira, P. Shoval, Information filtering: overview of issues, research and systems, User Modeling and User–Adapted Interaction 11 (2001) 203–259. [18] F. Herrera, E. Herrera-Viedma, Aggregation operators for linguistic weighted information, IEEE Transactions on Systems Man and Cybernetics Part A: Systems and Humans 27 (1997) 646–656. [19] F. Herrera, E. Herrera-Viedma, Choice functions and mechanisms for linguistic preference relations, European Journal of Operational Research 120 (2000) 144–161. [20] F. Herrera, E. Herrera-Viedma, J.L. Verdegay, Direct approach processes in group decision making using linguistic OWA operators, Fuzzy Sets and Systems 79 (1996) 175–190. [21] F. Herrera, L. Martı´nez, A 2-tuple fuzzy linguistic representation model for computing with words, IEEE Transactions on Fuzzy Systems 8 (6) (2000) 746– 752. [22] L. Martı´nez, F. Herrera, A model based on linguistic 2-tuples for dealing with multigranularity hierarchical linguistic contexts in multiexpert decisionmaking, IEEE Transactions on Systems, Man and Cybernetics. Part B: Cybernetics 31 (2) (2001) 227–234. [23] E. Herrera-Viedma, Modeling the retrieval process of an information retrieval system using an ordinal fuzzy linguistic approach, Journal of the American Society for Information Science and Technology 52 (6) (2001) 460–475. [24] E. Herrera-Viedma, An information retrieval system with ordinal linguistic weighted queries based on two weighting elements, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 9 (2001) 77–88. [25] E. Herrera-Viedma, S. Alonso, F. Chiclana, F. Herrera, A consensus model for group decision making with incomplete fuzzy preference relations, IEEE Transactions on Fuzzy Systems 15 (5) (2007) 863–877. [26] E. Herrera-Viedma, F. Chiclana, F. Herrera, S. Alonso, Group decision making model with incomplete fuzzy preference relations based on additive consistency, IEEE Transactions on Systems, Man and Cybernetics. Part B: Cybernetics 37 (1) (2007) 176–189. [27] E. Herrera-Viedma, O. Cordón, M. Luque, A.G. López, A.M. Muñoz, A model of fuzzy linguistic IRS based on multi-granular linguistic information, International Journal of Approximate Reasoning 34 (3) (2003) 221–239. [28] E. Herrera-Viedma, F. Herrera, F. Chiclana, A consensus model for multiperson decision making with different preference structures, IEEE Transactions on Systems Man and Cybernetics Part A: Systems and Humans 32 (3) (2002) 394– 402. [29] E. Herrera-Viedma, A.G. López-Herrera, A model of information retrieval system with unbalanced fuzzy linguistic information, International Journal of Intelligent Systems 22 (11) (2007) 1197–1214. [30] E. Herrera-Viedma, A.G. López-Herrera, M. Luque, C. Porcel, A fuzzy linguistic IRS model based on a 2-tuple fuzzy linguistic approach, International Journal of Uncertainty Fuzziness and Knowledge-Based Systems 15 (2) (2007) 225– 250. [31] E. Herrera-Viedma, A.G. López-Herrera, C. Porcel, Tuning the matching function for a threshold weighting semantics in a linguistic information retrieval system, International Journal of Intelligent Systems 20 (9) (2005) 921–937. [32] E. Herrera-Viedma, L. Martı´nez, F. Mata, F. Chiclana, A consensus support system model for group decision-making problems with multi-granular linguistic preference relations, IEEE Transactions on Fuzzy Systems 13 (5) (2005) 644–658. [33] E. Herrera-Viedma, G. Pasi, A.G. López-Herrera, C. Porcel, Evaluating the information quality of web sites: a methodology based on fuzzy computing with words, Journal of the American Society for Information Science and Technology 57 (4) (2006) 538–549. [34] E. Herrera-Viedma, E. Peis, Evaluating the informative quality of documents in SGML-format using fuzzy linguistic techniques based on computing with words, Information Processing and Management 39 (2) (2003) 233–249. [35] M.H. Hsu, A personalized English learning recommender system for ESL students, Expert Systems with Applications 34 (2008) 683–688. [36] R.R. Korfhage, Information Storage and Retrieval, Wiley Computer Publishing, New York, 1997. [37] T.-P. Liang, Recommendation systems for decision support: an editorial introduction, Decision Support Systems 45 (2008) 385–386. [38] J. Malinowski, T. Weitzel, T. Keim, Decision support for team staffing: an automated relational recommendation approach, Decision Support Systems 45 (2008) 429–447. [39] G. Marchionini, Research and development in digital libraries. . (accessed 04.09). [40] L. Martı´nez, M. Barranco, L.G. Pérez, M. Espinilla, A knowledge based recommender system with multigranular linguistic information, International Journal of Computational Intelligence Systems 1 (3) (2008) 225–236. [41] L. Martı´nez, L.G. Pérez, M. Barranco, M. Espinilla, Improving the effectiveness of knowledge based recommender systems using incomplete linguistic preference relations, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 16 (2) (2008) 33–56. [42] L. Martı´nez, L.G. Pérez, M. Barranco, A multi-granular linguistic based-content recommendation model, International Journal of Intelligent Systems 22 (5) (2007) 419–434. [43] A.W. Neumann, Recommender Systems for Information Providers: Designing Customer Centric Paths to Information, Physica-Verlag, Heidelberg, 2009. [44] G. Pasi, Modeling users’ preferences in systems for information access, International Journal of Intelligent Systems 18 (7) (2003) 793–808. [45] C. Porcel, A.G. López-Herrera, E. Herrera-Viedma, A recommender system for research resources based on fuzzy linguistic modeling, Expert Systems with Applications 36 (2009) 5173–5183. [46] C. Porcel, J.M. Moreno, E. Herrera-Viedma, A multi-disciplinar recommender system to advice research resources in university digital libraries. Expert Systems with Applications, doi:10.1016/j.eswa.2009.04.038. [47] L.M. Quiroga, J. Mostafa, An experiment in building profiles in information filtering: the role of context of user relevance feedback, Information Processing and Management 38 (2002) 671–694. [48] M.E. Renda, U. Straccia, A personalized collaborative digital library environment: a model and an application, Information Processing and Management 41 (2005) 5–21. [49] P. Reisnick, H.R. Varian, Recommender systems, Special issue of Communications of the ACM 40 (3) (1997) 56–59. [50] L. Ross, P. Sennyey, The library is dead, long live the library! The practice of academic librarianship and the digital revolution, The Journal of Academic Librarianship 34 (2) (2008) 145–152. [51] B. Sarwar, G. Karypis, J. Konstan, J. Riedl, Analysis of recommendation algorithms for e-commerce. in: Proceedings of ACM E-Commerce 2000 Conference, 2000, pp. 158–167. [52] L.A. Zadeh, The concept of a linguistic variable and its applications to approximate reasoning, Part I. Information Sciences 8 (1975) 199–249; L.A. Zadeh, The concept of a linguistic variable and its applications to approximate reasoning, Part II, Information Sciences 8 (1975) 301–357; L.A. Zadeh, The concept of a linguistic variable and its applications to approximate reasoning, Part III, Information Sciences 9 (1975) 43–80. C. Porcel, E. Herrera-Viedma / Knowledge-Based Systems 23 (2010) 32–39 39