A genetic algorithm based method for university course timetabling problems and application in Hanoi Open University

In this article, we review the timetabling problem especially in the credits training at universities. We also study genetic algorithms as a useful tool in soft computing to solve complex problems with many constraints and multi-objective. Therefore, the paper has proposed methods for solving timetabling problem in credits training at universities based on genetic algorithms. Some of the constraints of the problem have been considered for using the fuzzy parameters of hedge algebras to meet the actual requirements. The results of experiments in practice at the Faculty of Information Technology - Hanoi Open University show the effectiveness of the method. That’s the running time of the algorithm is quite fast, compared to the traditional solution, we have to change a lot of times to obtain the final results of timetable, by applying this results of our method, we have a stable timetable and put into student for registration. Moreover, the result shows that the timetable gives opportunity for students to register well, besides the reasonable suggestions are output from the algorithm. Compared to the prior semesters, the rate of canceled courses is lower. These demonstrated the potential effectiveness of the algorithm in practical application. The method proposed in the paper can be extended to apply for practicing in many situations. However, the hard and soft constraints may put more different assessments to show the suitable of each assessment, especially based on fuzzy parameters with more suitable for the purpose of actual use, thereby potentially increasing the efficiency up. These will be studied further and announced in the next research.

pdf17 trang | Chia sẻ: huongthu9 | Lượt xem: 435 | Lượt tải: 0download
Bạn đang xem nội dung tài liệu A genetic algorithm based method for university course timetabling problems and application in Hanoi Open University, để tải tài liệu về máy bạn click vào nút DOWNLOAD ở trên
Journal of Computer Science and Cybernetics, V.32, N.4 (2016), 285–301 DOI no. 10.15625/1813-9663/32/4/7962 A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE TIMETABLING PROBLEMS AND APPLICATION IN HANOI OPEN UNIVERSITY DUONG THANG LONG Faculty of Information Technology, Hanoi Open University, Vietnam duongthanglong@gmail.com  Abstract. Timetabling problems is one of very significant problems in many fields of applications. As mentioned in [3], this problem is NP-complete with numerous factors and constraints. In general, it is treated as a multi-objective optimization problem. Currently, the work of scheduling is very difficult in universities, especially in credits training. Almost it is impossible to control all cases of the problem by human. Therefore, we cannot manually give an effective solution for this problem. There are quite many methods to resolve this problem in literature, they are mostly searching methods based on genetic algorithms and their results are proved effective in practice. In this paper, we propose a method based on genetic algorithms for university course timetabling problems with some modifications and apply it to real-world datasets in Hanoi Open University. Keywords. University course timetabling, genetic algorithms. 1. INTRODUCTION The university timetabling is a typical scheduling problem. However, it is more complex factors than the form of conventional scheduling problems, such as teachers, students, time- slots, classrooms, credit classes or courses, and especially among the major constraints of these elements. More generally, the timetabling problem including many relevant factors should be considered, such as examinations, practice, lecture halls, etc. [3]. As mentioned in [3], Even and Itai showed that the scheduling problem is a NP-complete problem. Typically, scheduling problems are conducted in the traditional way, by intuitive and direct calculation of human. Currently, due to the diversity and the many changes of the binding between the elements in the problem, which are limited resources and the complexity of the factors, the scheduling problem often takes a lot of time and labor. So the use of computers to perform scheduling, not only to show the research interests of the authors, but also allows achieving superior results despite more constraints. Obviously, this leads to saving a lot of time and effort. Scheduling problems and methods of solving the problem have been researched from 1960. In [7], pointed out that, Hertz has proposed using Tabu search method to solve scheduling problems including two stages (TATI / TAG) and further pointed out that this is the appropriate way to solve school schedule and calendar with large-scale implementation. Genetic Algorithm (GA) was applied to solve the scheduling problem in universities in order to overcome the limitations of traditional methods. Optimization algorithm ants (ACO) are c© 2016 Vietnam Academy of Science & Technology 286 DUONG THANG LONG mentioned by Nothegger et al. for using to solve this hybridisationproblem. Meanwhile, Tassopoulos and Beligiannis’s method of swarm optimization establishes a timetable for the various schools in Greece. Al.Betar et al. proposed a method of hybrid (HHS) to solve scheduling problems for the university. HHS integrated algorithm optimization and climbing hills swarm to balance the elements of exploration and search. The authors, in [6], indicated that the genetic algorithm (GA) is a suitable method that can be used to identify, solve an optimization problem, especially the scheduling problem. The authors focused on the scheduling problem of the university, it can be divided into school schedule and exam scheduling. Accordingly, the method based on improved GA to solve this problem with changing the filter selection, hybridization and mutation to achieve high efficiency. The authors also used an alternative strategy, in order to avoid falling into local optimum. M. Abbaszadeh et al. in [1] using GA to solve scheduling problems in the universities, they have changed the structure of performing gene sequence, allowing genetic mutations to achieve the transferring 15% of better individuals to the next generation. Moreover, to avoid falling into local optimum, they considered the impact their parameters mutation, removed repetitive genes and replaced by better gene sequences. The result provides high efficiency and maximizes accuracy for binding. The authors of [8] proposed a genetic algorithm with the binding elements of the problem using fuzzy measure (fuzzy) in order to adapt to the practical constraints, such as the requirements of the faculty for time, teaching expertise. However, these authors mainly focus on a type of the problem which is all student fixed in each class. Since, we have to assign each class to some courses so the students of the class belong to those courses. This kind of the problem only can be applied to traditional training which does not allow flexibility of student learning. In opposite, in credit training we can allow student choosing courses, time-slots, even teachers for learning but the above methods cannot solve this kind of the problem. So, in this paper, we propose methods for solving scheduling problems for credit course timetabling problem using genetic algorithms with some constraint parameters using fuzziness of hedge algebra and added temperature factor to influence the genetic operations to bring high efficiency. The article consists of Part 1 as an introduction to the use of genetic algorithms for solving scheduling problems. Part 2 discussing in detail the genetic algorithms and scheduling problems in the education of university credit courses. Part 3 proposes a method for scheduling with parameters binding with electronic dimming processing based genetic algorithms mentioned in section 2. Section 4 is to build the program and test software at Hanoi Open University, reviews results. Finally is the conclusion. 2. PROBLEM FORMULATION In general, scheduling problem in universities (course timetabling problem - CTP) in- cludes finding the appropriate allocation of time within a limited time period for all events (such as courses, semester exams) and assigns them to a number of resources (teachers, stu- dents and classrooms) to ensure that the constraints are met [1, 2, 3, 4, 6]. However, in credit training, depending on the characteristics of each university, the scheduling problem will be deployed with certain differences. In this paper, we use resources including teachers A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 287 and classrooms. Remaining student factor will engage in the objective function calculations, which mentioned later. This would be appropriate to the case that the timetable of courses will be implemented before students enroll. According to the characteristics of each university and the organization of credit training, for convenience in the design of algorithms, in this paper we assume that the steps for holding the problem as follows: at the beginning of each semester, based on the ability of registering to subjects, we propose all courses from subjects, then we schedule or make a timetable for the courses and announced to students in order to register. Each course consists of its name, subjects, type of course-room (either small room or hall room or practice room). Scheduling problem now becomes the work assigned to teachers, time-slots, classrooms for each course proposed that satisfies the given requirements and constraints. Typically, the constraints of the problem needed to be satisfied are divided into two categories [1, 4, 9]: hard constraints and soft constraints. Hard constraints must certainly be met and fulfilled. It can be the following requirements: (H1) Not allowed to allocate the resources (teachers, classrooms) for various events (courses) in the same time. (H2) The classroom assigned to an event (course) must be within the appropriate re- sources for the event. Accordingly, an event was held in a room must have appropriate infrastructure to be able to organize events, such as computer labs, hall classrooms, etc. (H3) An event (course) can only be assigned to a teacher if he or she has sufficient knowledge and capability to respond to the teaching expertise of that course. (H4) Lecturers are assigned to teach only in the present time-slots to work at the school, and will be determined before making timetable. On the other hand, the soft constraints are desired to be satisfied as highly as possible and to meet as many constraints as possible, but it is not sufficient for a solution of problems. Some soft constraints are usually considered in the study and implementation of the following: (S1) Assign teachers to the courses so that teaching ability of teachers is as high as possible. (S2) Priority assigned timeslot to the courses with the teachers that the teacher’s desires is met. (S3) Ensure balance number of course for each teacher, i.e., the minimum and maximum number of courses should be taken. (S4) It should be given priority courses with prerequisite of a subject to the same timeslot. This aims to increase the ability to enroll for more students on the timetable. (S5) The ability of students enrolling on the timetable is as high as possible. This special meeting of the given type of credit training, as above, which the courses are firstly assigned time-slots, teachers and classrooms, then students will register to this timetable of courses, rather than a traditional timetable with students are fixed in each course. For credits training, we do not need to design curriculums with the fixed mandatory subjects of a term, instead, we always design a diagram of pre-requisite subjects for curricu- lums. Then, students want to choose subjects for learning in a term, he/she must be passed all pre-requisite subjects belonging to the chosen subjects. So the constraints S4, S5 have significant meanings. The S4 constraint shows that when a timetable with more subjects and theirs pre-requisite is at the same timeslot, students have more chance to choose subjects for learning. The S5 constraint indicates that the more chosen subjects, the less time to 288 DUONG THANG LONG complete learning at school. In addition, when we give a timetable with more chance for choosing subjects by learners, we may have more students at school in a term, then the school financing may increase, teachers have more teaching time, maximum facilities are used,... In addition, depending on the particular credit training and the requirements of each university, the soft constraints can be changed, choose additional factors for properly reality. Now we describe the input data for the scheduling problem and formalized as an optimization problem model. In this model, we use the following symbols: nR −Number of classrooms, {R1, R2, ..., RnR} denotes the set of classrooms nC −Number of events (courses), {C1, C2, ..., CnC} denotes the set of courses nL −Number of lecturers, {L1, L2, ..., LnL} denotes the set of lecturers. nS −Number of students, {S1, S2, ..., SnS} denotes the set of students nT −Number of time slots, {T1, T2, ..., TnT } denotes the set of time slots. Normally, the university timetable should be set weekly time-slots, such as morning Monday, afternoon Tuesday, morning Thursday,... of the week. Here we assume 6 school days of a week from Monday to Saturday, with two sessions in every day which is morning or afternoon, so we have 12 training sessions a week. However, we can divide each day into many period of time and the time-slots can be more than this. However, we can easily satisfy (H3), (S1) and (S3) constraints directly by assigning lecturers to every course based on experts and specialized users. Because the H3 and S1 constraints can be taken into account, each teacher is manually assigned to some courses by users, since we also easily satisfy H3 and S1 constraints. Similarly, the S3 constraint can be easily satisfied by user based on number of courses assigned to each teacher. This would have the potential to speed up the convergence of methods, because of reducing the search space. The H1 constraint will be taken to meet during scheduled. The remaining constraints can be represented by the matrix, which are called matrix constraints, as following: (H2) Matrix C × R = {CRi,j |i = 1, nC , j = 1, nR } defines each course can only be assigned to which classrooms. The value of this matrix is {0, 1}, 1 denotes that it can be assigned and 0 is not. (H4, S2) Matrix L×T = {LTk,l|k = 1, nL, l = 1, nT } defines each teacher may be present and teach at the school in the time-slots. We integrate H4 into S2 constraint, which is a priority in the timeslot as high as possible. Therefore, this matrix will receive value in the form of language. For example, NO, NORMAL, GOOD, VERY GOOD,.. The NO value denotes a teacher will not be present during at that timeslot. (S4) Matrix C × C = {CCi1,i2 |i1 = 1, nC , i2 = 1, nC} describe prerequisite subjects between courses, it will receive the binary value 0 if there is no prerequisite subjects or 1 if there is prerequisite subjects of them. The courses with prerequisite subjects should be in priority for the same timeslot in order to increase registered ability of students. (S5) Matrix S × C = {SCm,i|m = 1, nS , i = 1, nC} describes constraints on the ability of students registered to the courses, receiving binary values 1 or 0, respectively, may be registered or not. Now we represent the timetable as a table of the corresponding column is the courses, the corresponding rows are lecturers, times lots, classrooms are as follows: In which, as mentioned, we assigned lecturers to every courses to certainly satisfy re- quirements of the H3 constraint, this is also for balancing the number of courses for every teacher by users. However, a course can only continue running if the number of registered A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 289 Table 1. Representation of the timetable Courses C1 C2 .. Ci CnC Lecturers Lk1 . . . Lki Timeslots Tl1 . . . Tli Classroom Rj1 . . . Rji students is enough large. This is a dynamic factor, so universities should take somehow to ensure that as maximum courses are expected to be deployed as possible. Thus said the problem could be condensed in a shorten form, which distributes set of time-slots {T1, T2, ..., TnT } and set of classrooms {R1, R2, ..., RnR} on a table so that the full satisfaction of the hard constraints H1, H2, H4 and it can be reached the soft constraints S2, S4, S5 as high as possible. According to the representation of a timetable in Table 1, the hard constraints H1, H2, H4 can be defined as follows: (H1) No exist two columns (Ci1, Ci2) which pair values in two rows of teachers and either classrooms or time-slots is identical, ie: ∀(Ci1 6= Ci2) : (Lki1 , Tli1) 6= (Lki2 , Tli2) ∧ (Rji1 , Tli1) 6= (Rji2 , Tli2). (H2) No exists columns that pair values of course and corresponding room in C × R matrix equal to zero, ie: ∀(Ci, Rji) : CRi,ji 6= 0. (H4) No exists columns which pair values of lecturers and corresponding time-slots in L× T matrix equal to zero, ie: ∀(Lki , Tli) : LTki,li 6= 0. We apply hedge algebras based fuzzy parameters to S2 soft constraint for calculating satisfying of the problem, whereby each teacher will be defined a priority set by the terms of hedge algebras [5]. For example, lecturer Lk select the preferred time-slots by using 3 terms of hedge algebras which including {low, normal, high}. The representative function of priority time-slots for lecturers treating as triangles with the top vertex is the center (υ(x)) of x, two remaining vertices is selected from the center of the two beside time-slots in the sequences. The figure 1 illustrates a teacher chooses morning Tuesday at a normal, morning Wednesday at a high, and low for afternoon Wednesday, the remaining time-slots are not chosen, i.e., the teacher is not at university for that time. We denote that the priorities time-slots of teachers to p(t), the function for representing of scheduled time-slots of teachers is r(t), then the evaluated function of time-slots satisfying for teachers, which can be used by min operator as following: s(t) = min{r(t), p(t)}. The Figure 2 illustrates triangular function p(t) (Figure 2.a) which is chosen by teachers, the function r(t) represents scheduled time-slots (Figure 2.b), and s(t) is the intersection of the two above functions (Figure 2.c). 290 DUONG THANG LONG Figure 1. Selection of priority-time-slots of a teacher Figure 2.a. time-slots chosen by teachers Figure 2.b. time-slots scheduled for Figure 2.c. time-slots satisfying evaluated of timetable Figure 2. Evaluation function for time-satisfaction level of teachers A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 291 We now can assess the degree of time-satisfaction of teachers for all time, denote by Lk, as following ([7]): F 2k = ∫∞ 0 s(t)dt∫∞ 0 r(t)dt . The S4 soft constraint can easily count how many pairs of courses, where one of them is prerequisite of another according to C ×C matrix and the courses is at the same scheduled time, called x is the number of counts, then this would be highly bound rates are as follows: F 4 = 1− 1√ x2 + 1 . The S5 soft constraint is evaluated based on the capability of registering students accord- ing to the register matrix S ×C and the timetable which is scheduled. For each student Sq, we build a plan for registering on the timetable, this may be applied the optimal method to build, called yq is the number of registered courses by student Sq. Notice that each subject can be established many courses, so students cannot enroll more than one course of each subjects. To ensure compliance with regulations and consistent practice, the most appro- priate value yq is about 4 to 7 in one semester and should not exceed 10 subjects, whereas evaluation for this form of binding the following trapezoidal: F 5q =  yq − 1 3 , 1 ≤ yq ≤ 4 1 , 4 ≤ yq ≤ 7 10− yq 3 , 7 ≤ yq ≤ 10 0 , else. Model of the problem can be stated as a formalization of multi objective optimization problems as follows:( nL∑ k=1 F 2k ) → max, F 4 → max,  nS∑ q=1 F 5q → max subject to (H1.a) nC∑ i=1 zi,l,j ≤ 1, l = 1, nT , j = 1, nR. (H1.b) nC∑ i=1 wi,k,l ≤ 1, k = 1, nL, l = 1, nT . (H2) nT∑ l=1 nR∑ j=1 CRi,j · zi,l,j = 1, i = 1, nC . (H4) nL∑ k=1 nT∑ l=1 ω (LTk,l) · wi,k,l = 1, i = 1, nC . which zi,l,j is a binary variable defined the course Ci is assigned to time-slot Tl and classrooms Rj if its value is 1 or otherwise ω(.) is a function to determine a value of LT matrix is NO 292 DUONG THANG LONG (ω = 0) or not (ω = 1), wi,k,l is a binary variable determining the course Ci is assigned to teacher Lk and time-slot Tl if its value is 1 or otherwise. In the next section we will design optimization search algorithm to the scheduling problem based on genetic algorithms. 3. GENETIC ALGORITHM BASED METHOD FOR CREDIT COURSE TIMETABLING PROBLEMS In this section, an algorithm based on genetic algorithm (GA) for scheduling problem as mentioned in section 2 is introduced. The method based on GA optimization parameters has combined to temperature evolution control, i.e., it will participate in the genetic operations to increase the convergence of the algorithm, as well as restrictions fall into local optimum [5]. GA is known as a repeating search procedure and widely used in solving optimization problems, this method is based on the basis of biological evolution. Some candidate solutions of the problem will be maintained in iterations of evolution. Genetic operators such as mutation and crossover are applied to evolve the solution in hopes of finding a good solution and at the same time have a high chance to maintain into the next generation. First, this approach needs to encode solutions of the problem into an appropriate gene sequence which can apply appropriate genetic operators to. There are two different approaches to this encoding [8], which is directly encoding and indirectly one. For direct encoding, all the properties of the solution as courses, teachers, time-slots, classrooms, etc., are directly encoded in the gene sequence of the gene (chromosome). However, direct encoding method may risk making an invalid solution, i.e., the hard constraint violations since apply genetic operations to generate offspring from parent. Another way is the indirect encoding method, with each gene sequence will need a procedure to build the suitable timetable from there. For example, a genetic sequence (chromosome) usually represents a sequence of events that are put on the timetable based on a given procedure. 3.1. Gene sequence encoding for a solution of the problem The encoding method may also use one of three methods, which is binary encoding with each gene is represented binary value parameter of the schedule, or integer encoding or real encoding. Authors, in [5], show that there are no specific directions for using the type of encoding scheme in the specified problem rather, it depends upon the applicability and the requirements of the problem, but the real encoding is more used than binary encoding or integer encoding for function optimization problems. In this paper, we use a method of real encoding and direct form. To overcome the case of a solution encoding genes fall into the hard constraint violations, we use a penalty in fitness function of solutions, which will be described later. We encode all parameters of the timetable given in table 1 to a chromosome. So a chromosome has nC × 3 length of gens. In the case of easier, without loss of generality, we can assign teachers to every course in the timetable by experts or users, and it now need to be scheduled (assigned time-slots, classrooms) for all courses. We have just designed a solution gene sequence corresponding to table 1. The length of chromosome is now nC × 2, as following: A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 293 Figure 3. Chromosome of gene encoding of solution The value of each gene (gi) of this encoding is limited to [0, 1], thereby to determine the value of real domain corresponding of time-slots as well as classrooms as a function as follow: fT : [0, 1]→ [1, nT ], fT (gi) = dgi · nT e fR : [0, 1]→ [1, nR], fR(gi) = dgi · nRe in which, symbols d·e to get the nearest upper integer of values. 3.2. Fitness function (for each individuals corresponding gene sequence) The fitness function of a solution depends on the binding hard constraints and soft con- straints. As mentioned, the direct encoding method, the hard constraints will be evaluated as a coefficients penalty in fitness function to avoid violations hard constraints of individu- als. Soft constraints are in the partly qualitative form, they are vague values and difficult to be quantified precisely and there are some cause and effect relationships between them, which we can describe as fuzzy rules. Then, we use fuzzy logic to argue, assessment of soft constraints and define a proper function for each soft constraint. According to the analyzed model of the problem, in the second part, we have to maximize three functions F 2k , F 4, F 5q corresponding to soft constraints S2, S4 and S5 as high as possible. However, in GA, it is usual converted to minimize these functions. We give a function to minimize as goal of problem as following: FS = w2 · ( 1 NL nL∑ k=1 (1− F 2k ) ) + w3 · (1− F 4) + w4 · ( 1 NS nS∑ q=1 (1− F 5q ) ) which, the values w1, w2, w3 to weight the components of the objective function. To set the penalty coefficient in the objective function for evaluating the degree of constraint violation hard constraints, we use the following function: FH = 1 1 + c(H1) + c(H2) + c(H4) c(.) is a function to count the number of violations of hard constraints H1, H2, H4 and it is calculated as follows: c(H1) = ‖{(zi1,l,j × zi2,l,j) = 1 : i1 6= i2}‖+ ‖{(wi1,k,l × wi2,k,l) = 1 : i1 6= i2}‖ , c(H2) = ‖{(CRi,j = 0, zi,l,j = 1)}‖ , 294 DUONG THANG LONG c(H4) = ‖{(LTk,l = 0, wi,k,l = 1)}‖ , Finally, we build a fitness function of individual corresponding to the gene sequence weighted by sum of weighting of FH and FS fitness =w1 · ( 1− 1 1 + c(H1) + c(H2) + c(H4) ) + w2 · ( 1 NL nL∑ k=1 (1− F 2k ) ) + w3 · (1− F 4) + w4 · ( 1 NS nS∑ q=1 (1− F 5q ) ) . This fitness is a smooth function, so it can give well evolution in GA. So, the hard constraints can be met and fulfilled by setting the weight of FH much greater than the other (FS), then soft constraints can be satisfied after that. 3.3. The genetic operations In this article, we use the genetic operations are integrated temperature Tk as an addi- tional parameter (where k is the index of current generation), called a temperature genetic al- gorithm [19]. The probability for the selection and mutation operations are changing through each generation, they are scaled to the temperature parameters of current generation. The initial temperature parameter is calculated based on the number of generations (often quite large to ensure the diversity of the population). After each generation, the temperature parameters will be descending to ensure the convergence and stability of algorithms. We have to calculate genetic operations for generate new population. As in [5], the se- lection operation is nonlinear ranking of exponential function, the individuals can be sorted by decreasing order of its fitness values, individual i (ranked ith) will be selected in popula- tions parents as the probability is calculated based on the current temperature parameters of the evolution. In crossover operation, randomly selects one of three evenly distributed as a cutoff point crossover, linear crossover and extended linear crossover. The mutation and reproduction operations are calculated as in [5] to create new generation for evolution. The genetic operations are as follows: (a) Selection: we use the exponential nonlinear ranking for selection, individuals of a pop- ulation are ranked in decreasing order of Fitness, the ith ranked individual will be selected for parent with the following probabilities: p = ((1-a).a−i)/(aN pop-1), a = 1+γ(Tk)/Npop, γ(Tk) = 1+(γmax – 1).(ln(T )-ln(Tk))/(ln(T )-ln(Tend)), where, Npop is the number of individuals in population, Tk = T0, α k is the current tem- perateness of the present generation k (k = 1,...,Gmax), this parameter decreases from the initial temperature T to Tend = T0.α Gmax (0 < α < 1, usually chosen α = 0.7), Gmax is the number of evolutionary generations. The function γ(Tk) increases linearly by the number of generations that have evolved from 1 to γmax (usually chosen γmax = 0.9). (b) Crossover: uniform randomly selecting one of three following crossover operations: (b.1) Single point cross over: for an individual X, X|i denotes the first sequence of gens on X taking into account the position i and i|X is the rest of gene sequence on X (from (i+1)th gen to the end). Given two parents X and Y , an uniform randomly selected ith position on A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 295 two parents, since two off springs U and V were generated by permutation on two parts of X and Y as followed: U = X|i ⊕i |Y, V = Y |i ⊕i |X, where, ⊕ denotes concatenate two gen sequences. (b.2) Linear arithmetic crossover: randomly selected a ∈(0,1), two off spring U and V are generated from two parents X and Y as follows: U [j] = a.X[j] + (1− a).Y [j], V [j] = (1− a).X[j] + a.Y [j], j = 1, ..., Lidv where X[j] is the jth gen of X, Lidv is the length of gen sequences. For convenience, we write in the form U = a.X + (1− a).Y and V = (1− a).X + a.Y. (b.3) Extended linear arithmetic crossover: uniform randomly selected a position ith then it divides each gene sequence of the parent X and Y into two parts, applying the linear arithmetic crossover (b.2) on each of the subgroups as follows: U |i = a.X|i + (1− a).Y |i and i|U = (1− a).i|X + ai|Y, V |i = (1− a).X|i + a.Y |i and i|V = ai|X + (1− a).i|Y , U = U |i ⊕i |U and V = V |i ⊕i |V. where, a is uniform randomly selected as in (b.2). (c) Mutation: suppose a gene whose value denoted by X[i] is inlimited range [Lc, Ld] is chosen for mutation, then the new value of mutated gen is calculated by X[i]′ = X[i] + z.(X[i]− Lc) if u < 0.5, = X[i] + z.(Ld −X[i]) if u ≥ 0.5, where u randomly selected in [0, 1], and z = sign(u− 0.5).Tk((1 + 1/Tk)|2u−1| − 1), Tk is the temperature of the kth generation. (d) Replacement: is the method of parent substitution by offspring, each offspring will compete with the best parent. Call gp1, gp2, gc the fitness values of the parents and a child, g∗ = max {gp1, gp2}, then the child is accepted with probability p = min {1, e−(gc−g∗)/Tk}. In the case of unacceptable child, the parent corresponding to g∗ is accepted for the next generation. The next section we develop a computer program for the proposed approach, tested on a data set of examples and real data sets in the Faculty of Information Technology - Hanoi Open University. 4. EXPERIMENTS RESULTS The computer program is built in C/C++, compiled on Windows 8 platform. The configuration of computer to run experiments on PCs with processor speed of 3.3GHz, 4G RAMS. 4.1. Experimenting with sample problems Data of the sample problem is given as follows: 296 DUONG THANG LONG Table 2. The parameters of the test sample data Parameters Meaning nC = 5 Three courses {C1, C2, C3} nL = 2 Two lectures {L1, L2} nR = 2 Two classrooms {R1, R2} nT = 3 Three time-slots {T1, T2, T3} nS = 20 Twenty students for registering As proposed, the teachers are predefined to teach courses such as Table 3. This will remove the checking constraint H1 and reduce the search space of GA optimization, leading to speed up the convergence of the algorithm. Table 3. Determining teachers to the courses Courses C1 C2 C3 Teachers L1 L2 L1 In this experiment, we leave out the C ×C matrix, corresponding weighting components of fitness function is w3 = 0, as the criteria of courses with prerequisite conditions that they may have same time-slots in the time table as much as possible. Since this constraint is also expressed in considering the goal of increasing student registration at S5 soft constraint. The remaining constraints are reflected in following matrices: Table 4. Constraints between courses and classrooms (C ×R) C1 C2 C3 R1 1 1 R2 1 Table 5. Constraints between teachers and time-slots (C × T ) T1 T2 T3 L1 low high normal L2 normal high Table 6. Constraints between students and courses (S × C) Students 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 C1 1 1 1 1 1 1 1 1 1 1 C2 1 1 1 1 1 1 1 1 1 1 C3 1 1 1 1 1 1 1 1 1 The experiment running with the proposed method, we manually choose GA parameters based on experiments as the following table: A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 297 Table 7. Parameters for running experiments Parameters Values α −descending temperature of scaling 0.7 γmax - maximum temperature factor 9 Npop - population size 50 Gmax - number of generations 10 pc - crossover probability 0.9 pm - mutation probability 0.1 w1 0.9 w2 0. w3 0. w4 0.1 In this experiment, we do not use the soft constraints between teachers and time-slots, the soft constraints between courses and course by prerequisite conditions, corresponding w2 and w3 respectively. The results of experiment running with the given data is in the follow- ing timetable, in which all hard constraints are satisfied, the soft constraints are evaluated according to fitness function value reached 0.886667. Note that Table 5 shows constraints between teachers and time-slots (L × T ) that there is only one timetable which met these constraints (see below table). Table 8. The timetable result of the experiment Courses C1 C2 C3 Teachers L1 L2 L1 Time-slots T1 T2 T3 Classrooms R1 R2 R1 As above result, the possibility of registration of students can achieve is 29 out of its original capacity of 29 (reached 100%). 4.2. Experiment with real-world problems Currently, in the Faculty of Information Technology - Hanoi Open University, we establish three semesters in a year with two main semesters and one optional semester. We have more than 1000 students. The number of subjects in curriculum is 42 and it has many of requisite constraints of subjects. We also use system management software for supporting the list of all students with ability enrolled in each subject at the beginning of a semester. We have 8 classrooms of theory with a capacity about 70 students per room, and also have 3 computer labs for practicing with holding 35 students per room, 2 halls holding 140 students. We use weekly time-slots, so it has 7 days for learning (both Saturday and Sunday), 3 time-slots per day (morning, afternoon and evening, except for Thursday night), so the total number of time-slots is 20. The data in the experiment is for the first semester (HK1) of 2015-2016 school years. Based on the statistical capabilities of the 1044 student registration, the expected open 298 DUONG THANG LONG courses are 86 corresponding 36 subjects. The constraints matrix is set up consisting of C × R, L × T , S × C (as too big to detail here). The teachers are predefined to courses depend on their expert by users. The parameters for running is same as Table 7, but some parameters are properly changed, such as Npop = 250, Gmax = 1000, and weights of fitness function are w1 = 0.9, w2 = 0.01, w3 = 0, w4 = 0.09. We run the experiment in three times. Result of the first time is showed in Table 12. In which, the symbol “S” is the morning, “C” is the afternoon of time-slots with suffixes indexed is day of week (from 2 to 7 corresponding Monday to Saturday respectively), teachers index, courses index and classrooms index are #L, #C and #R column respectively. This timetable gives a total ability of all registering students is 4697 respectively, equivalent to 4.5 courses per student (approximately 14 credits per student), it is properly with minimum credits of regulations. We use the result of first run for implementing in practice to students enrolled and results are 87.2% success of courses (i.e. the number of courses was canceled because of low student enrollment of 12.8%). This no success is depending on student’s registration, we give students the suggestions for registration. Compared to the previous semesters, this no success of rate is quite low, such as in 2014-2015 school year, the no success rate of 3rd semester (HK3) is 27.4%, 2nd semester (HK2) is 18.95%, 1st semester (HK1) is 14, 6% (see Figure 3). This shows that the timetable generated by our method with registration suggested can increase the capability of students registering, leading to reduction of canceled courses than before. On the other hand, as mentioned, this is a random search algorithm, we run three times with the average time is 185 seconds per each run. The graph in Figure 4 shows the fitness value of the best individuals in each generation and the hard constraints are violated through evolution of the algorithm in Figure 5. In Figure 5, it also shows that we reach a timetable with no violate hard constraints at about 700th generation of evolution. This assures the timetable of results which has no violations hard constraints. 5. CONCLUSION In this article, we review the timetabling problem especially in the credits training at universities. We also study genetic algorithms as a useful tool in soft computing to solve complex problems with many constraints and multi-objective. Therefore, the paper has proposed methods for solving timetabling problem in credits training at universities based on genetic algorithms. Some of the constraints of the problem have been considered for using the fuzzy parameters of hedge algebras to meet the actual requirements. The results of experiments in practice at the Faculty of Information Technology - Hanoi Open University show the effectiveness of the method. That’s the running time of the A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 299 Table 12. Timetable of 1st experiment running 300 DUONG THANG LONG Figure 3. The successful rate of courses in the timetable given by proposed method Figure 4. The changing fitness of individuals through evolutionary time Figure 5. Hard constraints violations through evolutionary time A GENETIC ALGORITHM BASED METHOD FOR UNIVERSITY COURSE 301 algorithm is quite fast, compared to the traditional solution, we have to change a lot of times to obtain the final results of timetable, by applying this results of our method, we have a stable timetable and put into student for registration. Moreover, the result shows that the timetable gives opportunity for students to register well, besides the reasonable suggestions are output from the algorithm. Compared to the prior semesters, the rate of canceled courses is lower. These demonstrated the potential effectiveness of the algorithm in practical application. The method proposed in the paper can be extended to apply for practicing in many situations. However, the hard and soft constraints may put more different assessments to show the suitable of each assessment, especially based on fuzzy parameters with more suitable for the purpose of actual use, thereby potentially increasing the efficiency up. These will be studied further and announced in the next research. REFERENCES [1] M. Abbaszadeh and S. Saeedvand, “A fast genetic algorithm for solving university scheduling problem,” IAES International Journal of Artificial Intelligence, vol. 3, no. 1, p. 7, 2014. [2] Z. Bratkovic´, T. Herman, V. Omrcˇen, M. Cˇupic´, and D. Jakobovic´, “University course timetabling with genetic algorithm: A laboratory excercises case study,” in European Conference on Evolu- tionary Computation in Combinatorial Optimization. Springer, 2009, pp. 240–251. [3] R.-M. Chen and H.-F. Shih, “Solving university course timetabling problems using constriction particle swarm optimization with local search,” Algorithms, vol. 6, no. 2, pp. 227–244, 2013. [4] V. Kolonias, G. Goulas, C. Gogos, P. Alefragis, and E. Housos, “Solving the examination timetabling problem in gpus,” Algorithms, vol. 7, no. 3, pp. 295–327, 2014. [5] R. Malhotra, N. Singh, and Y. Singh, “Genetic algorithms: Concepts, design for optimization of process controllers,” Computer and Information Science, vol. 4, no. 2, p. 39, 2011. [6] A. O. Modupe, O. E. Olusayo, and O. S. Olatunde, “Development of a university lecture timetable using modified genetic algorithms approach,” International Journal of Advanced Research in Com- puter Science and Software Engineering, vol. 4, no. 9, pp. 163–168, 2014. [7] R. Perzina and J. Ramik, “Self-learning genetic algorithm for a timetabling problem with fuzzy constraints,” International Journal of Innovative Computing, Information and Control, vol. 9, no. 11, pp. 4565–4582, 2013. [8] ——, “Timetabling problem with fuzzy constraints: a self-learning genetic algorithm,” Interna- tional Journal of Engineering and Innovative Technology (IJEIT), vol. 3, no. 4, pp. 105–113, 2013. [9] B. Sigl, M. Golub, and V. Mornar, “Solving timetable scheduling problem using genetic algo- rithms,” in Information Technology Interfaces, 2003. ITI 2003. Proceedings of the 25th Interna- tional Conference on. IEEE, 2003, pp. 519–524. Received on March 28 - 2016 Revised on May 04 - 2017

Các file đính kèm theo tài liệu này:

  • pdfa_genetic_algorithm_based_method_for_university_course_timet.pdf