Salient object detection: A survey

2019-08-05 01:45:00AliBorjiMingMingChengQibinHouHuaizuJiangandJiaLi

Computational Visual Media 2019年2期

Ali Borji, Ming-Ming Cheng(), Qibin Hou, Huaizu Jiang, and Jia Li

Abstract Detecting and segmenting salient objects from natural scenes, often referred to as salient object detection, has attracted great interest in computer vision. While many models have been proposed and several applications have emerged,a deep understanding of achievements and issues remains lacking. We aim to provide a comprehensive review of recent progress in salient object detection and situate this field among other closely related areas such as generic scene segmentation, object proposal generation, and saliency for fixation prediction. Covering 228 publications, we survey i) roots, key concepts, and tasks, ii) core techniques and main modeling trends, and iii) datasets and evaluation metrics for salient object detection. We also discuss open problems such as evaluation metrics and dataset bias in model performance, and suggest future research directions.

Keywords salient object detection; saliency; visual attention; regions of interest

1 Introduction

Humans are able to detect visually distinctive, so called salient, scene regions effortlessly and rapidly in a pre-attentive stage. These filtered regions are then perceived and processed in finer detail for the extraction of richer high-level information,in an attentive stage. This capability has long been studied by cognitive scientists and has recently attracted much interest in the computer vision community, mainly because it helps to find the objects or regions that efficiently represent a scene,a useful step in complex vision problems such as scene understanding. Some topics that are closely or remotely related to visual saliency include: salient object detection [1], fixation prediction [2, 3], object importance [4-6], memorability [7], scene clutter[8], video interestingness [9-12], surprise [13], image quality assessment [14-16], scene typicality [17, 18],aesthetics[11],and scene attributes[19]. Given space limitations, this paper cannot fully explore all of the aforementioned research directions. Instead, we only focus on salient object detection, a research area that has greatly developed in the past twenty years, and in particular since 2007 [20].

1.1 What is salient object detection about?

Salient object detection or salient object segmentation is commonly interpreted in computer vision as a process that includes two stages: 1) detecting the most salient object and 2) segmenting the accurate region of that object. Rarely, however, models explicitly distinguish between these two stages (with few exceptions such as Refs. [21-23]). Following the seminal works by Itti et al. [24] and Liu et al. [25],models adopt the saliency concept to simultaneously perform the two stages together. This is witnessed by the fact that these stages have not been separately evaluated. Further, mostly area-based scores have been employed for model evaluation (e.g., precisionrecall). The first stage does not necessarily need to be limited to only one object. The majority of existing models,however,attempt to segment the most salient object, although their prediction maps can be used to find several objects in a scene. The second stage falls into the realm of classic segmentation problems in computer vision but with the difference that here,accuracy is only determined by the most salient object.

In general, it is agreed that for good saliency detection a model should meet at least the following three criteria: 1) good detection: the probability of missing real salient regions and falsely marking the background as a salient region should be low,2) high resolution: saliency maps should have high or full resolution to accurately locate salient objects and retain original image information, and 3) computational efficiency: as front-ends to other complex processes, these models should detect salient regions quickly.

1.2 Situating salient object detection

Salient object detection models usually aim to detect only the most salient objects in a scene and segment the whole extent of those objects. Fixation prediction models, on the other hand, typically try to predict where humans look, i.e., a small set of fixation points[31, 32]. Since both types of method output a single continuous-valued saliency map, where a higher value in this map indicates that the corresponding image pixel is more likely to be looked at, they can be used interchangeably.

A strong correlation exists between fixation locations and salient objects. Furthermore, humans often agree with each other when asked to choose the most salient object in a scene [22, 23, 26]. See Fig. 1.

Unlike salient object detection and fixation prediction models, object proposal models aim at producing a small set, typically a few hundreds or thousands, of overlapping candidate object bounding boxes or region proposals [33]. Object proposal generation and salient object detection are highly related. Saliency estimation is explicitly used as a cue in objectness methods [34, 35].

Image segmentation, also called semantic scene labeling or semantic segmentation, is one of the very well researched areas in computer vision(e.g.,Ref.[36]). In contrast to salient object detection,where the output is a binary map, these models aim to assign a label, one out of several classes such as sky, road, and building, to each image pixel.

Figure 2 illustrates the differences between these research themes.

Fig. 1 An example image in Borji et al. 's experiment [26] along with annotated salient objects. Dots represent 3-second free-viewing fixations.

1.3 History of salient object detection

One of the earliest saliency models, proposed by Itti et al. [24], generated the first wave of interest across multiple disciplines including cognitive psychology,neuroscience, and computer vision. This model is an implementation of earlier general computational frameworks and psychological theories of bottomup attention based on center-surround mechanisms(e.g., feature integration theory by Treisman and Gelade [50], the guided search model by Wolfe et al. [51], and the computational attention architecture by Koch and Ullman [52]). In Ref. [24], Itti et al. show some examples where their model is able to detect spatial discontinuities in scenes. Subsequent behavioral (e.g., Ref. [53]) and computational(e.g., Ref. [54]) investigations used fixations as a means to verify the saliency hypothesis and to compare models.

Fig. 2 Sample results produced by different models. Left to right: input image, salient object detection [27], fixation prediction [24],image segmentation (regions with various sizes) [28], image segmentation (superpixels with comparable sizes) [29], and object proposals (true positives) [30].

A second wave of interest surged with the works of Liu et al. [25, 55] and Achanta et al. [56] who defined saliency detection as a binary segmentation problem.These authors were inspired by some earlier models striving to detect salient regions or proto-objects(e.g., Ma and Zhang [57], Liu and Gleicher [58], and Walther and Koch[59]). A plethora of saliency models has emerged since then. It has been,however,less clear how this new definition relates to other established computer vision areas such as image segmentation(e.g., Refs. [60, 61]), category independent object proposal generation (e.g., Refs. [30, 34, 62]), fixation prediction (e.g., Refs. [54, 63-66]), and object detection (e.g., Refs. [67, 68]).

A third wave of interest has appeared recently with the surge in popularity of convolutional neural networks (CNNs) [69], and in particular with the introduction of fully convolutional neural networks[70]. Unlike the majority of classic methods based on contrast cues [1], CNN-based methods both eliminate the need for hand-crafted features, and alleviate the dependency on center bias knowledge,and hence have been adopted by many researchers.A CNN-based model normally contains hundreds of thousands of tunable parameters and neurons with variable receptive field sizes. Neurons with large receptive fields provide global information that can help better identify the most salient region in an image, while neurons with small receptive fields provide local information that can be leveraged to refine saliency maps produced by the higher layers. This allows highlighting salient regions and refining their boundaries. These desirable properties enable CNN-based models to achieve unprecedented performance compared to hand-crafted feature-based models. CNN models are gradually becoming the mainstream direction in salient object detection.

2 Survey of the state-of-the-art

In this section,we review related works in 3 categories,including: 1) salient object detection models, 2)applications, and 3) datasets. The similarity of various models means that it is sometimes hard to draw sharp boundaries between them. Here we mainly focus on the models contributing to the major waves in the chronicle shown in Fig. 3.

2.1 Old testament: classic models

A large number of approaches have been proposed for detecting salient objects in images in the past two decades. Except for a few models which attempt to segment objects-of-interest (e.g., Refs. [71-73]),most approaches aim to identify salient subsets from images first (i.e., compute a saliency map) and then integrate them to segment the entire salient object.

Visual subsets could be pixels, blocks, superpixels,or regions. Blocks are rectangular patches uniformly sampled from the image; pixels are 1×1 blocks. A superpixel or a region is a perceptually homogeneous image patch that is confined within intensity edges. Superpixels, in the same image, often have comparable but different sizes, while the shapes and sizes of regions may change remarkably. In this review, the term block is used to represent pixels and patches, while superpixel and region are used interchangeably.

In general, classic approaches can be categorized in two different ways depending on the type operation or attributes they exploit.

Fig. 3 A simplified chronicle of salient object detection modeling. The first wave started with the Itti et al. model [24], followed by the second wave with the introduction of the approach of Liu et al. [25] who were the first to define saliency as a binary segmentation problem. The third wave started with the surge of deep learning models and the model of Li and Yu [47].

1.Block-based versus region-based analysis.Two types of visual subsets have been utilized:blocks and regions,to detect salient objects. Blocks were primarily adopted by early approaches, while regions became popular with the introduction of superpixel algorithms.2.Intrinsic cues versus extrinsic cues. A key step in detecting salient objects is to distinguish them from distractors. To do so, some approaches extract various cues only from the input image itself, to highlight targets and to suppress distractors (i.e., the intrinsic cues). However,other approaches argue that intrinsic cues are often insufficient to distinguish targets and distractors, especially when they share common visual attributes. To overcome this issue, they incorporate extrinsic cues such as user annotation,depth maps, or statistical information about similar images to facilitate detection of salient objects in the image.

Using the above model categorization, four combinations are thus possible. To structure our review, we group the models into three major subgroups: 1) block-based models with intrinsic cues,

2) region-based models with intrinsic cues, and 3)models with extrinsic cues (both block- and regionbased). Some approaches that do not easily fit into these subgroups are discussed in an other classic models subgroup. Reviewed models are listed in Table 1 (intrinsic models), Table 2 (extrinsic models),and Table 3 (other classic models).

2.1.1 Block-based models with intrinsic cues In this subsection, we mainly review salient object detection models which utilize intrinsic cues extractedfrom blocks. Following the seminal work of Itti et al. [24], salient object detection is widely defined as capturing uniqueness, distinctiveness, or rarity in a scene.

Table 1 Salient object detection models with intrinsic cues (sorted by year). Elements: {PI = pixel, PA = patch, RE = region}, where prefixes m and h indicate multi-scale and hierarchical versions, respectively. Hypothesis: {CP = center prior, G = global contrast, L = local contrast, D = edge density, B = background prior, F = focus prior, O = objectness prior, CV = convexity prior, CS = center-surround contrast,CLP = color prior, SD = spatial distribution, BC = boundary connectivity prior, SPS = sparse noise}. Aggregation/optimization: {LN =linear, NL = non-linear, AD = adaptive, HI = hierarchical, BA = Bayesian, GMRF = Gaussian MRF, EM = energy minimization, and LS =least-square solver}. Code: {M= Matlab, C= C/C++, NA = not available, EXE = executable}

Table 2 Salient object detection models with extrinsic cues grouped by their adopted cues. For cues: {GT = ground-truth annotation, SI =similar images, TC = temporal cues, SCO = saliency co-occurrence, DP = depth, and LF = light field}. For saliency hypothesis: {P = generic properties, PRA = pre-attention cues, HD = discriminativity in high-dimensional feature space, SS = saliency similarity, CMP = complement of saliency cues, SP = sampling probability, MCO = motion coherence, RP = repeatedness, RS = region similarity, C = corresponding, and DK = domain knowledge}. Others: {CRF = conditional random field, SVM = support vector machine, BDT = boosted decision tree, and RF = random forest}

Table 3 Other salient object detection models

In early works [56-58], uniqueness was often computed as the pixel-wise center-surround contrast.Hu et al. [74] represent the input image in a 2D space using the polar transformation of its features.Each region in the image is then mapped into a 1D linear subspace. Afterwards, generalized principal component analysis (GPCA) [75] is used to estimate the linear subspaces without actually segmenting the image. Finally, salient regions are selected by measuring feature contrast and geometric properties of regions. Rosin [76] proposes an efficient approach for detecting salient objects.His approach is parameter-free and requires only very simple pixel-wise operations such as edge detection,threshold decomposition, and moment preserving binarization. Valenti et al. [77] propose an isophotebased framework where the saliency map is estimated by linearly combining saliency maps computed in terms of curvedness, color boosting, and isocenter clustering.

In an influential study, Achanta et al. [37] adopt a frequency-tuned approach to compute full resolution saliency maps. The saliency of pixel x is computed as

where Iμis the mean pixel value of the image(e.g., RGB/Lab features) and Iωhcis a Gaussian blurred version of the input image (e.g., using a 5×5 kernel).

Without prior knowledge of the sizes of salient objects, multi-scale contrast is frequently adopted for robustness [25, 58]. An L-layer Gaussian pyramid is first constructed (as in Refs. [25, 58]). The saliency score of pixel x in the image at the lth level of this pyramid (denoted as I(l)) is defined as

where N(x) is a neighborhood window centered at x (e.g., 9×9 pixels). Even with such multi-scale enhancement, intrinsic cues derived at pixel level are often too poor to support object segmentation.To address this, some works (e.g., Refs. [25, 56, 78,79]) extended contrast analysis to the patch level(comparing patches to their neighbors).

Later in Ref. [78], Klein and Frintrop proposed an information-theoretic approach to compute center-surround contrast using the Kullback-Leibler divergence between distributions of features such as intensity, color, and orientation. Li et al. [79]formulated center-surround contrast as a costsensitive max-margin classification problem. The center patch is labeled as a positive sample while the surrounding patches are all used as negative samples.The saliency of the center patch is then determined by its separability from surrounding patches based on a trained cost-sensitive support vector machine(SVM).

Some works have defined patch uniqueness as a patch's global contrast to other patches [39].Intuitively, a patch is considered to be salient if it is significantly different from the other patches most similar to it; their spatial distances are taken into account. Similarly, Borji and Itti computed local and global patch rarity in RGB and Lab color spaces and fused them to predict fixation locations[65]. In recent work [80], Margolin et al. define the uniqueness of a patch by measuring its distance to the average patch based on the observation that distinctive patches are more scattered than non-distinctive ones in the highdimensional space. To further incorporate the patch distributions, the uniqueness of a patch is measured by projecting its path to the average patch onto the principal components of the image.

To sum up,approaches in this section aim to detect salient objects based on pixels or patches utilizing only intrinsic cues. These approaches usually suffer from two shortcomings: 1)high-contrast edges usually stand out instead of the salient object, and 2) the boundary of the salient object is not preserved well(especially when using large blocks). To overcome these issues, some methods propose to compute saliency based on regions. This offers two main advantages. First, the number of regions is far fewer than the number of blocks, offering the potential to develop highly efficient and fast algorithms.Second, more informative features can be extracted from regions, leading to better performance. Such region-based approaches are discussed in the next subsection.

2.1.2 Region-based models with intrinsic cues

Saliency models in the second subgroup adopt intrinsic cues extracted from image regions generated using methods such as graph-based segmentation[81], mean-shift [28], SLIC [29], or Turbopixels [82].Unlike block-based models,region-based models often segment an input image into regions aligned with intensity edges first, and then compute a regional saliency map.

As an early attempt, in Ref. [58], regional saliency score is defined as the average saliency score of the region's pixels, defined in terms of multi-scale contrast. Yu and Wong [83] propose a set of rules to determine the background scores of each region based on observations from background and salient regions. Saliency, defined as uniqueness in terms of global regional contrast, is widely studied in many approaches [42, 84-87]. In Ref. [84], a region-based saliency algorithm is introduced by measuring the global contrast between the target region and all other image regions. In a nutshell, an image is first segmented into N regions {ri}Ni=1. Saliency of region riis measured as

where Dr(ri,rj) captures the appearance contrast between two regions. Higher saliency scores are assigned to regions with large global contrast. wijis a weight linking regions riand rj, which incorporates spatial distance and region size. Perazzi et al. [27]demonstrate that if Dr(ri,rj) is defined as the Euclidean color distance between riand rj, global contrast can be computed using efficient filtering based techniques [88].

In addition to color uniqueness, distinctiveness of complementary cues such as texture [85] and structure [89] are also considered for salient object detection. Margolin et al. [80] propose to combine regional uniqueness and patch distinctiveness to form a saliency map. Instead of maintaining a hard region index for each pixel, a soft abstraction is proposed in Ref. [86] to generate a set of large-scale perceptually homogeneous regions using histogram quantization and Gaussian mixture models (GMMs). By avoiding hard decisions about boundaries of superpixels, such soft abstraction provides large spatial support which results in a more uniform saliency region.

In Refs. [93], Jiang et al. propose a multi-scale local region contrast based approach, which calculates saliency values across multiple segmentations for robustness purposes and combines these regional saliency values to obtain a pixel-wise saliency map. A similar idea for estimating regional saliency using multiple hierarchical segmentations is adopted in Refs. [42, 98]. Li et al. [79] extend pairwise local contrast by building a hypergraph, constructed by non-parametric multi-scale clustering of superpixels,to capture both internal consistency and external separation of regions. Salient object detection is then cast as finding salient vertices and hyperedges in the hypergraph.

Salient objects, in terms of uniqueness, can also be defined as sparse noise in a certain feature space in which the input image is represented as a lowrank matrix [94, 102, 103]. The basic assumption is that non-salient regions (i.e., background) can be explained by the low-rank matrix while the salient regions are indicated by sparse noise.

Based on such a general low-rank matrix recovery framework, Shen and Wu [94] propose a unified approach to incorporate traditional low-level features with higher-level guidance, e.g., center prior, face prior, and color prior, to detect salient objects based on a learned feature transformation. (Although extrinsic ground-truth annotations are adopted to learn high-level priors and the feature transformation,we classify this model with intrinsic models to better organize the low-rank matrix recovery based approaches. Additionally, we treat face and color priors as universal intrinsic cues for salient object detection). Instead, Zou et al. [102] propose to exploit bottom-up segmentation as a guidance cue for low-rank matrix recovery, for robustness. Similar to Ref. [94], high-level priors are also adopted in Ref. [103], where tree-structured sparsity-inducing norm regularization is introduced to hierarchically describe the image structure, in order to uniformly highlight the entire salient object.

In addition to capturing uniqueness, more and more priors have also been proposed for salient object detection. The spatial distribution prior [25] implies that the more widely a color is distributed in the image, the less likely a salient object is to contain this color. The spatial distribution of superpixels can also be efficiently evaluated in linear time using the Gaussian blurring kernel, in a similar way to computing global regional contrast in Eq.(3). Such a spatial distribution prior is also considered in Ref.[89],and is evaluated in terms of both color and structural cues.

A center prior assumes that a salient object is more likely to be found near the image center, and that the background tends to be far away from the image center. To this end, the backgroundness prior is adopted for salient object detection in Refs.[95,97-99],assuming that a narrow border of the image forms the background region, i.e., the pseudo-background.With this pseudo-background as a reference, regional saliency can be computed as the contrast of regions versus “background”. In Ref. [97], a two-stage saliency computation framework is proposed based on manifold ranking on an undirected weighted graph. In the first stage, regional saliency scores are computed based on the relevance given to each side of the pseudo-background queries. In the second stage, the saliency scores are refined based on the relevance given to the initial foreground. In Ref. [98],saliency computation is formulated in terms of dense and sparse reconstruction errors with respect to the pseudo-background. The dense reconstruction error of each region is computed from principal component analysis (PCA) of the background templates, while the sparse reconstruction error is defined as the residual after sparse representation of the background templates. These two types of reconstruction errors are propagated to pixels in multiple segmentations,which are fused to form the final saliency map. Jiang et al. [99] formulate saliency detection via absorbing Markov chains, in which the transient and absorbing nodes are superpixels around the image center and border respectively. The saliency of each superpixel is computed as the absorption time between the transient node and the absorbing nodes of the Markov chain.

Beyond these approaches, the generic objectness prior is also used to facilitate salient object detection by leveraging object proposals [34]. Although it is learned from training data, we also tend to treat it as a universal intrinsic cue for salient object detection. Chang et al. [92] present a computational framework by fusing the objectness and regional saliency into a graphical model. These two terms are jointly estimated by iteratively minimizing an energy function that encodes their mutual interaction. In Ref. [100], region objectness is defined as the average objectness values of the pixels within the region; it is incorporated into regional saliency computation. Jia and Han [101] compute the saliency of each region by comparing it to the“soft”foreground and background according to the objectness prior.

Salient object detection relying on the pseudobackground assumption may fail sometimes,especially when the object touches the image border.To overcome this problem, a boundary connectivity prior is utilized in Refs. [84, 105]. Intuitively, salient objects are much less connected to the image border than objects in the background are. Thus, the boundary connectivity score of a region can be estimated according to the ratio of its length along the image border to the spanning area of this region[105]. The latter can be computed based on the region's geodesic distances to the pseudo-background and other regions respectively. Such a boundary connectivity score is integrated into a quadratic objective function to get the final optimized saliency map. It is worth noting that similar ideas of boundary connectivity prior are also investigated in[102] as segmentation prior and as surroundingness in Ref. [106].

The focus prior, the fact that a salient object is often photographed in focus to attract more attention, has been investigated in Refs. [100, 107].Jiang et al. [100] calculate the focus score from the degree of focal blur. By modeling defocusing as the convolution of a sharp image with a point spread function, approximated by a Gaussian kernel, the pixel-level degree of focus can be estimated as the standard deviation of the Gaussian kernel by scale space analysis. A regional focus score is computed by propagating the focus score and/or sharpness at the boundary and interior edge pixels. The saliency score is finally derived from a non-linear combination of uniqueness (global contrast), objectness, and focus scores.

Performance of salient object detection based on regions can be affected by choice of segmentation parameters. In addition to other approaches based on multi-scale regions [42, 79, 93], single-scale potential salient regions are extracted by solving the facility location problem in Ref. [87]. An input image is first represented as an undirected graph of superpixels,where a much smaller set of candidate region centers is then generated through agglomerative clustering.On this set, a submodular objective function is built to maximize the similarity. By applying a greedy algorithm, the objective function can be iteratively optimized to group superpixels into regions whose saliency values are further measured via the regional global contrast and spatial distribution.

The Bayesian framework can also be exploited for saliency computation [96, 108], formulated as estimating the posterior probability of pixel x being foreground given the input image I. To estimate the saliency prior, a convex hull H is first estimated around the detected points of interest. The convex hull H, which divides the image I into the inner region RIand outer region RO, provides a coarse estimation of foreground as well as background, and can be adopted for likelihood computation. Liu et al. [104] use an optimization-based framework for detecting salient objects. As in Ref. [96], a convex hull is roughly estimated to partition an image into pure background and potential foreground. Then,saliency seeds are learned from the image, while a guidance map is learned from background regions, as well as human prior knowledge. Using these cues, a general linear elliptic system with Dirichlet boundary is introduced to model diffusion from seeds to other regions to generate a saliency map.

Among the models reviewed in this subsection,there are three main types of region adopted for saliency computation. Irregular regions of varying sizes can be generated using a graph-based segmentation algorithm [81], mean-shift algorithm[28], or clustering (quantization). On the other hand, with recent progress in superpixel algorithms,compact regions with comparable sizes are also popular choices, using the SLIC algorithm [29],Turbopixel algorithm [82], etc. The main difference between these two types of regions is whether the influence of region size should be taken into account.Furthermore, soft regions can also be considered for saliency analysis, where each pixel maintains a probability of belonging to each region (component)instead of having a hard region label (e.g., fitted by a GMM). To further enhance robustness of segmentation, regions can be generated based on multiple segmentations or in a hierarchical way.Generally, single-scale segmentation is faster, while multi-scale segmentation can improve the overall quality of results.

To measure the saliency of regions, uniqueness,usually in the form of global and local regional contrast, is still the most frequently used feature.In addition, more and more complementary priors for regional saliency have been investigated to improve the overall results, such as backgroundness,objectness, focus, and boundary connectivity.Compared to block-based saliency models,incorporation of these priors is the main advantage of region-based saliency models. Furthermore,regions provide more sophisticated cues (e.g., a color histogram) to better capture the salient object in a scene, in contrast to pixels and patches. Another benefit of defining saliency using regions is related to efficiency. Since the number of regions in an image is far fewer than the number of pixels, computing saliency at region level can significantly reduce the computational cost while producing full-resolution saliency maps.

Notice that the approaches discussed in this subsection only utilize intrinsic cues. In the next subsection, we review how to incorporate extrinsic cues to facilitate the detection of salient objects.

2.1.3 Models with extrinsic cues

Models in the third subgroup adopt extrinsic cues to assist in the detection of salient objects in images and videos. In addition to those visual cues observed in the single input image, extrinsic cues can be derived from ground-truth annotation of training images,similar images, video sequences, a set of input images containing the common salient objects, depth maps,or light field images. In this section, we will review such models according to the type of extrinsic cues used. Table 2 lists all models with extrinsic cues;each method is highlighted with several predefined attributes.

Salient object detection with similar images. With the availability of an increasingly large amount of visual content on the web, salient object detection by leveraging visually similar images to the input image has been studied in recent years. Generally, given the input image I, K similar imagesare first retrieved from a large collection of images C.Salient object detection in the input I can be assisted by examining these similar images.

In some studies, it is assumed that saliency annotations of C are available. For example,Marchesotti et al. [113] propose to describe each indexed image Ikby a pair of descriptors,which respectively denote the feature descriptors(Fisher vector) of the salient and non-salient regions according to the saliency annotations. To compute the saliency map, each patch pxof the input image is described by a Fisher vector fx. Saliencies of patches are computed according to their contrast with foreground and background region features:

Alternatively, based on the observation that different features contribute differently to the saliency analysis of each image, Mai et al. [115] propose to learn image specific rather than universal weights to fuse the saliency maps computed on different feature channels. To this end, the CRF aggregation model of saliency maps is trained only on retrieved similar images to account for the dependence of aggregation on individual images. We will give further technical details of Ref. [115] in Section 2.1.4.

Saliency based on similar images works well if large-scale image collections are available. Saliency annotation, however, is time consuming, tedious, and even intractable on such collections. To mitigate this,some methods leverage unannotated similar images.Using web-scale image collections C,Wang et al.[114]propose a simple yet effective saliency estimation algorithm. The pixel-wise saliency map is computed as

Siva et al. [35] propose a probabilistic formulation for saliency computation as a sampling problem. A patch pxis considered to be salient if it has the low probability of being sampled from the images CI∪I.In other words, a high saliency score will be given to pxif it is unusual among a bag of patches extracted from similar images.

Co-saliency object detection. Instead of concentrating on computing saliency in a single image,co-salient object detection algorithms focus on discovering common salient objects shared by multiple input images. Such objects can be the same object from different viewpoints, or objects in the same category, sharing similar visual appearance.Note that the key characteristic of co-salient object detection algorithms is that their input is a set of images,while classical salient object detection models only need a single input image.

Co-saliency detection is closely related to the concept of image co-segmentation, which aims to segment similar objects from multiple images [124,125]. As stated in Ref. [121], three major differences exist between co-saliency and co-segmentation. First,co-saliency detection algorithms only focus on detecting common salient objects, while similar but non-salient background might be also segmented out in co-segmentation approaches [126, 127]. Second,some co-segmentation methods, e.g., Ref. [125], need user input to guide the segmentation process in ambiguous situations. Third, salient object detection often serves as a pre-processing step, and thus more efficient algorithms are preferred than for cosegmentation algorithms, especially when processing a large number of images.

Li and Ngan [119] propose a method to compute co-saliency for an image pair with some objects in common. The co-saliency is defined as the inter-image correspondence, i.e., low saliency values should be given to dissimilar regions. Similarly in Ref. [120], Chang et al. propose to compute cosaliency by exploiting the additional repeatedness property across multiple images. Specifically, the co-saliency score of a pixel is defined as the multiplication of its traditional saliency score [39]and its repeatedness likelihood over the input images.Fu et al. [121] propose a cluster-based co-saliency detection algorithm by exploiting the well-established global contrast and spatial distribution concepts on a single image. Additionally, corresponding cues over multiple images are introduced to account for saliency co-occurrence.

2.1.4 Other classic models

In this section, we review algorithms that aim to directly segment or localize salient objects with bounding boxes, and algorithms that are closely related to saliency detection. Some subsections offer a different categorization of some models covered in the previous sections (e.g., supervised versus unsupervised). See Table 3.

Localization models. Liu et al. [25] convert the binary segmentation map to bounding boxes. The final output is a set of rectangles around salient objects. Feng et al. [128] define saliency for a sliding window as its composition cost using the remaining image parts. Based on an over-segmentation of the image, the local maxima, which can efficiently be found among all sliding windows in a brute-force manner, are assumed to correspond to salient objects.

The basic assumption in many previous approaches is that at least one salient object exists in the input image. This may not always hold as some background images contain no salient objects at all. In Ref. [129],Wang et al. investigate the problem of localizing and predicting the existence of salient objects in thumbnail images. Specifically, each image is described by a set of features extracted in multiple channels. The existence of salient objects is formulated as a binary classification problem. For localization, a regression function is learned using random forest regression on training samples to directly output the position of the salient object.

Segmentation models. Segmenting salient objects is closely related to the figure-ground problem, which is essentially a binary classification problem, trying to separate the salient object from the background. Yu et al. [90] utilize the complementary characteristics of imperfect saliency maps generated by different contrast-based saliency models. Specifically, two complementary saliency maps are first generated for each image, including a sketch-like map and an envelope-like map. The sketch-like map can accurately locate parts of the most salient object(i.e., skeleton with high precision),while the envelopelike map can roughly cover the entire salient object(i.e., envelope with high recall). With these two maps, reliable foreground and background regions can be detected in each image by first training a pixel classifier. By labeling all other pixels with this classifier, salient object can be detected as a whole.This method is extended in Ref. [131] by learning complementary saliency maps for the purpose of salient object segmentation.

Lu et al. [91] exploit the convexity (concavity)prior for salient object segmentation. This prior assumes that the region on the convex side of a curved boundary tends to belong to the foreground. Based on this assumption,concave arcs are first found on the contours of superpixels. The convexity context of a concave arc is defined by windows close to the arc. An undirected weight graph is then built over superpixels with concave arcs,where the weights between vertices are determined by summing the concavity context at different scales in the hierarchical segmentation of the image. Finally, the normalized cut algorithm[134] is used to separate the salient object from the background.

To leverage contextual cues more effectively, Wang et al. [130] propose to integrate an auto-context classifier [135] into an iterative energy minimization framework to automatically segment the salient object. The auto-context model is a multi-layer boosting classifier on each pixel and its surroundings to predict whether it is associated with the target concept. The subsequent layer is built on the classification of the previous layer. Hence, through the layered learning process, spatial context is automatically utilized for more accurate segmentation of the salient object.

Supervised versus unsupervised models. The majority of existing learning-based works on saliency detection focus on the supervised scenario,i.e., learning a salient object detector given a set of training samples with ground-truth annotation.The aim here is to separate salient elements from background elements.

Each element(e.g.,a pixel or a region)in the input image is represented by a feature vector f ∈?D,where D is the feature dimension. Such a feature vector is then mapped to a saliency score s ∈?+based on the learned linear or non-linear mapping function f :?D→?+.

One can assume the mapping function f is linear,i.e., s = wTf, where w denotes the combination weights of all components in the feature vector. Liu et al.[25]learn the weights with a conditional random field(CRF)model trained on rectangular annotations of the salient objects. In recent work [111], the large-margin framework is adopted to learn the weights w.

Due to the highly non-linear nature of the saliency mechanism, however, a linear mapping may not perfectly capture the characteristics of saliency. To this end,the linear approach is extended in Ref.[109],where a mixture of linear support vector machines(SVMs) is adopted to partition the feature space into a set of sub-regions that are linearly separable using a divide-and-conquer strategy. In each region,a linear SVM,its mixture weights,and the combination parameters of the saliency features are learned for better saliency estimation. Alternatively, other nonlinear classifiers such as boosted decision trees(BDTs)[110, 112] and random forest (RFs) [40] may also be utilized.

Generally speaking, supervised approaches allow richer representations for the elements compared with heuristic methods. In seminal work on supervised salient object detection, Liu et al. [25] propose a set of features including local multi-scale contrast,regional center-surround histogram distance, and global color spatial distribution. As for models with only intrinsic cues, region-based representation for salient object detection has become increasingly popular as more sophisticated descriptors can be extracted at region level. Mehrani and Veksler[110] demonstrate promising results by considering generic regional properties, e.g., color and shape,which are widely used in other applications like image classification. Jiang et al. [40] propose a regional saliency descriptor including regional local contrast, regional backgroundness, and regional generic properties. In Refs. [111, 112], each region is described by a set of features such as local and global contrast, backgroundness, spatial distribution,and the center prior. Pre-attentive features are also considered in Ref. [111].

Usually, richer representations result in feature vectors with higher dimensions, e.g., D = 93 in Ref. [40] and D = 75 in Ref. [112]. With the availability of large collections of training samples,the learned classifier is capable of automatically integrating such richer features and selecting the most discriminative ones. Therefore, better performance can be expected than with heuristic methods.

Some models have utilized unsupervised techniques.In Ref. [35], saliency computation is formulated in a probabilistic framework as a sampling problem. The saliency of each image patch is proportional to its sampling probability from all patches extracted from both the input image and similar images retrieved from a corpus of unlabeled images. In Ref. [136],cellular automata are exploited for unsupervised salient object detection.

Aggregation and optimization models. Given M saliency maps, coming from different salient object detection models or hierarchical segmentations of the input image, aggregation models try to form a more accurate saliency map. Let Si(x) denote the saliency value of pixel x in the ith saliency map. In Ref. [132], Borji et al. propose a standard saliency aggregation method as follows:

where fx= (S1(x),··· ,SM(x)) are the saliency scores for pixel x and sx=1 indicates x is labeled as salient. ζ(·) is a real-valued function which takes the following form:

Inspired by the aggregation model in Ref. [132],Mai et al. [115] propose two aggregation solutions.The first solution adopts pixel-wise aggregation:

Alternatively, Yan et al. [42] integrate saliency maps computed on hierarchical segmentations of the image into a tree-structured graphical model,where each node corresponds to a region in every level of the hierarchy. Thanks to the tree structure,saliency inference can efficiently be conducted using belief propagation. In fact, solving the three layer hierarchical model is equivalent to applying a weighted average to all single-layer maps. Unlike naive multi-layer fusion, this hierarchical inference algorithm can select optimal weights for each region instead of a global weighting.

Li et al. [133] propose to optimize the saliency values of all superpixels in an image to simultaneously meet several saliency criteria including visual rarity,center-bias, and mutual correlation. Based on the correlations (similarity scores) between region pairs,the saliency value of each superpixel is optimized by quadratic programming when considering the influences of all other superpixels. Let wijdenote the correlation between two regions riand rj. The saliency values{si}Ni=1(denoting s(ri)as sifor short)can be optimized by solving:

Here dDis half the image diagonal length, and dijand diare spatial distances from rito rjand the image center, respectively. In the optimization, the saliency value of each superpixel is optimized by quadratic programming, considering the influences of all other superpixels. Zhu et al. [105] also adopt a similar optimization-based framework to integrate multiple foreground/background cues as well as smoothness terms to automatically infer optimal saliency values.

The Bayesian framework is adopted to more effectively integrate the complementary dense and sparse reconstruction errors [98]. A fully-connected Gaussian Markov random field between each pair of regions is constructed to enforce consistency between salient regions [101], which permitting efficient computation of the final regional saliency scores.

Active models. Inspired by interactive segmentation models (e.g., Refs. [137, 138]), a new trend has emerged recently, explicitly decoupling the two stages of saliency detection mentioned in Section 1.1:1)detecting the most salient object and 2)segmenting it. Some studies propose to perform active segmentation by utilizing the advantages of both fixation prediction and segmentation models. For example,Mishra et al. [21] combine multiple cues (e.g., color,intensity, texture, stereo, and/or motion) to predict fixations. The“optimal”closed contour for the salient object around the fixation point is then segmented in polar space. Li et al. [22] propose a model composed of two components: a segmenter that proposes candidate regions and a selector that gives each region a saliency score (using a fixation prediction model).Similarly, Borji [23] proposes to first roughly locate the salient object at the peak of the fixation map (or its estimation using a fixation prediction model) and then segment the object using superpixels. The last two algorithms adopt annotations to determine the upper-bound of segmentation performance, propose datasets with multiple objects in scenes, and provide new insight into the inherent connections between fixation prediction and salient object segmentation.

Salient object detection in video. In addition to spatial information,video sequences provide temporal cues, e.g., motion, which facilitates salient object detection. Zhai and Shah[116]first estimate keypoint correspondences between two consecutive frames.Motion contrast is computed based on planar motions(the homography)between images,which is estimated by applying RANSAC to point correspondences. Liu et al. [117] extend their spatial saliency features[25] to the motion field resulting from an optical flow algorithm. Using the colorized motion field as the input image, local multi-scale contrast,regional center-surround distance, and global spatial distribution are computed and finally integrated in a linear way. Rahtu et al. [108] integrate spatial saliency into an energy minimization framework by considering the temporal coherence constraint. Li et al. [118] extend regional contrast-based saliency to the spatio-temporal domain. Given an oversegmentation of the frames of the video sequence,spatial and temporal region matches between each two consecutive frames are estimated in a interactive manner on an undirected unweighted matching graph,based on the regions' colors, textures, and motion features. The saliency of a region is determined by computing its local contrast to the surrounding regions not only in the present frame but also in the temporal domain.

Salient object detection with depth. We live in a 3D environment in which stereoscopic content provides additional depth cues for guiding visual attention and understanding our surroundings. This point is further validated by Lang et al. [139] through experimental analysis of the importance of depth cues for eye fixation prediction. Recently, researchers have started to study how to exploit depth cues for salient object detection [122, 123]; these might be captured indirectly from stereo images or directly using a depth camera (e.g., Kinect).

The most straightforward extension is to adopt the widely used hypotheses introduced in Section 2.1.1 and 2.1.2 to the depth channel, e.g., global contrast on the depth map [122, 123]. Furthermore, Niu et al. [122] demonstrate how to leverage domain knowledge in stereoscopic photography to compute the saliency map. The input image is first segmented into regions {ri}. In practice, the regions at the focus of attention are often assigned small or zero disparities to minimize the vergence-accommodation conflict. Therefore, the first type of regional saliency based on disparity is defined as

where dmaxand dminare the maximal and minimal disparities, respectively. ˉdidenotes the average disparity in region ri. Additionally, objects with negative disparities are perceived as popping out of the scene. The second type of regional stereo saliency is then defined as

Stereo saliency is linearly computed by an adaptive weight.

Salient object detection on light fields. The idea of using light fields for saliency detection was proposed in Ref. [107]. A light field, captured using a specifically designed camera,e.g.,Lytro,is essentially an array of images shot by a grid of cameras viewing the scene. Light field data offers two benefits for salient object detection: 1) it allows synthesis of a stack of images focused at different depths, and 2) it provides an approximation of scene depth and occlusions.

With this additional information, Li et al. [107]first utilize the focus and objectness priors to robustly choose the background and select foreground candidates. Specifically, the layer with the estimated background likelihood score is used to estimate the background regions. Regions, coming from a meanshift algorithm, with high foreground likelihood score are chosen as salient object candidates. Finally, the estimated background and foreground are utilized to compute a contrast-based saliency map on the all-focus image.

A new challenging benchmark dataset for lightfield saliency analysis, known as HFUT-Lytro, was recently introduced in Ref. [140].

2.2 New testament: Deep learning based models

All methods reviewed so far use heuristics to detect salient objects. While hand-crafted features allow real-time detection performance, they suffer from several shortcomings that limit their ability to capture salient objects in challenging scenarios.

Convolutional neural networks (CNNs) [69], one of the most popular tools in machine learning, have been applied to many vision problems such as object recognition [141], semantic segmentation [70], and edge detection[142]. Recently,it has been shown that CNNs [44, 47] are also very effective when applied to salient object detection. Thanks to their multilevel and multi-scale features, CNNs are capable of accurately capturing the most salient regions without any prior knowledge(e.g., segment-level information).Furthermore, multi-level features allow CNNs to better locate the boundaries of the detected salient regions, even when shades or reflections exist. By exploiting the strong feature learning ability of CNNs,a series of algorithms has been proposed to learn saliency representations from large amounts of data.These CNN-based models continually improve upon the best results so far on almost all existing datasets,and are becoming the main stream solution. The rest of this subsection is dedicated to reviewing CNNbased models.

Basically, salient object detection models based on deep learning can be split into two main categories.The first category includes models that use multilayer perceptrons (MLPs) for saliency detection.In these models, the input image is usually oversegmented into single- or multi-scale small regions.Then, a CNN is used to extract high-level features which are later fed to an MLP to determine the saliency value of each small region. Though highlevel features are extracted from CNNs, unlike fully convolutional networks (FCNs), the spatial information from CNN features cannot be preserved because of the utilization of MLPs. To highlight the differences between these methods and FCN-based methods, we call them classic convolutional network based (CCN-based) methods. The second category includes models that are based on fully convolutional networks (FCN-based). The pioneering work of Long et al. [70] falls under this category and aims to solve the semantic segmentation problem. Since salient object detection is inherently a segmentation task,a number of researchers have adopted FCN-based architectures because of their ability to preserve spatial information.

Table 4 shows a list of CNN-based saliency models.

2.2.1 CCN-based models

One-dimensional convolution based methods. As an early attempt, He et al. [44] followed a regionbased approach to learn superpixel-wise feature representations. Their approach dramatically reduces the computational cost compared to pixel-wise CNNs,while also taking global context into consideration.However, representing a superpixel with its mean color is not informative enough. Further, the spatial structure of the image is difficult to fully represent using 1D convolution and pooling operations, leading to cluttered predictions, especially when the input image is a complex scene.

Leveraging local and global context. Wang et al. consider both local and global information for better detection of salient regions [160]. To this end,two subnetworks are designed, one each for local estimation and global search. A deep neural network(DNN-L) is first used to learn local patch features to determine the saliency value of each pixel, followed by a refinement operation which captures high-level objectness. For global search, they train another deep neural network (DNN-G) to predict the saliency value of each salient region using a variety of global contrast features such as geometric information, etc.

The top K candidate regions are utilized to compute the final saliency map using a weighted summation.

In Ref. [46], as in most classic salient object detection methods, both local context and global context are taken into account to construct a multicontext deep learning framework. The input image is first fed to the global-context branch to extract global contrast information. Meanwhile, each image patch,which is a superpixel-centered window,is fed to the local-context branch to capture local information.A binary classifier is finally used to determine the saliency value by minimizing a unified softmax loss between the prediction value and the ground truth label. A task-specific pre-training scheme is adopted to jointly optimize the designed multi-context model.

Lee et al. [144] exploit two subnetworks to encode low-level and high-level features separately. They first extract a number of features for each superpixel and feed them into a subnetwork composed of a stack of convolutional layers with 1×1 kernel size. Then,the standard VGGNet [152] is used to capture highlevel features. Both low- and high-level features are flattened, concatenated, and finally fed into a twolayer MLP to judge the saliency of each query region.

Bounding box based methods. In Ref. [48], Zou and Komodakis propose a hierarchy-associated rich feature (HARF) extractor. A binary segmentation tree is first built to extract hierarchical image regions and to analyze the relationships between all pairs of regions. Two different methods are then used to compute two kinds of features(HARF1and HARF2)for regions at the leaf-nodes of the binary segmentation tree.They leverage all the intermediate features extracted from the RCNN[161]to capture various characteristics of each image region. With these high-dimensional elementary features, both local regional contrast and border regional contrast for each elementary feature type are computed, to build a more compact representation. Finally, the AdaBoost algorithm is adopted to gradually assemble weak decision trees to construct a composite strong regressor.

Kim and Pavlovic [145] design a two-branch CNN architecture to obtain coarse-and fine-representations of coarse-level and fine-level patches, respectively.The selective search [162] method is utilized to generate a number of region candidates that are treated as input to the two-branch CNN. Feeding the concatenation of the feature representations of the two branches into the final fully connected layer allows a coarse continuous map to be predicted.To further refine the coarse prediction map, a hierarchical segmentation method is used to sharpen its boundaries and improve spatial consistency.

In Ref. [146], Wang et al. detect salient objects by employing the fast R-CNN [161] framework. The input image is first segmented into multi-scale regions using both over-segmentation and edge-preserving methods. For each region, the external bounding box is used and the enclosed region is fed to the fast R-CNN. A small network composed of multiple fully connected layers is connected to the ROI pooling layer to determine the saliency value of each region.Finally, an edge-based propagation method is used to suppress background regions and make the resulting saliency map more uniform.

Kim and Pavlovic [147] train a CNN to predict the saliency shape of each image patch. The selective search method is first used to localize a stack of image patches, each of which is taken as input to the CNN. After predicting the shape of each patch, an intermediate mask MIis computed by accumulating the product of the mask of the predicted shape class and the corresponding probability, and averaging all the region proposals. To further refine the coarse prediction map, shape class-based saliency detection with hierarchical segmentation (SCSD-HS) is used to incorporate more global information, which is often needed for saliency detection.

Li et al. [149] leverage both high-level features from CNNs and low-level features extracted using hand-crafted methods. To enhance the generalization and learning ability of CNNs, the original R-CNN is redesigned by adding local response normalization(LRN) to the first two layers. The selective search method is utilized [162] to generate a stack of square patches as the input to the network. Both high-level and low-level features are fed to an SVM with L1hinge-loss to help judge the saliency of each square region.

Models with multi-scale inputs. Li and Yu [47]utilize a pre-trained CNN as a feature extractor.Given an input image, they first decompose it into a series of non-overlapping regions and then feed them into a CNN with three different-scale inputs. Three subnetworks are then employed to capture advanced features at different scales. The features obtained from patches at three scales are concatenated and then fed into a small MLP with only two fully connected layers, using it as a regressor to output a distribution over binary saliency labels. To solve the problem of imperfect over-segmentation,a superpixelbased saliency refinement method is used.

Fiugre 4 illustrates a number of popular FCNbased architectures. Table 5 lists different types of information leveraged by these architectures.

Discussion. As can be seen, MLP-based works rely mostly on segment-level information (e.g., image patches) and classification networks. These image patches are normally resized to a fixed size and are then fed into a classification network which is used to determine the saliency of each patch. Some models use multi-scale inputs to extract features at several scales. However, such a learning framework cannot fully leverage high-level semantic information.Further, spatial information cannot be propagated to the last fully connected layers, thus resulting in global information loss.

Fig. 4 Popular FCN-based architectures. Apart from the classical architecture (a), more and more advanced architectures have been developed recently. Some of them (b-e) exploit skip layers from different scales so as to learn multi-scale and multi-level features. Some (e, g-i) adopt an encoder-decoder structure to better fuse high-level features with low-level ones. Others (f, g, i) introduce side supervision as in Ref. [142] in order to capture more detailed multi-level information. See Table 5 for details of these architectures.

Table 5 Different types of information leveraged by existing FCNbased models. Abbreviations: SP: superpixel, SS: side supervision,RCL: recurrent convolutional layer, PCF: pure CNN feature, IL:instance-level, Arch: architecture

2.2.2 FCN-based models

Unlike CCN-based models that operate at the patch level, fully convolutional networks (FCNs) [70]consider pixel-level operations to overcome problems caused by fully connected layers such as blurring and inaccurate predictions near the boundaries of salient objects. Due to the desirable properties of FCNs, a great number of FCN-based salient object detection models have been introduced recently.

Li and Yu [151] design a CNN with two complementary branches: a pixel-level fully convolutional stream (FCS) and a segment-wise spatial pooling stream (SPS). The FCS introduces a series of skip layers after the last convolutional layer of each stage;the skip layers are fused together as the output of the FCS.Note that a stage of the CNN is composed of all layers with the same resolution. The SPS leverages segment-level information for spatial pooling. Finally,the outputs of FCS and SPS are fused, followed by a balanced sigmoid cross entropy loss layer as used in Ref. [142].

Liu and Han [150] propose two subnetworks to produce a prediction map working in a coarse-to-fine and global-to-local manner. The first subnetwork can be considered as an encoder whose goal is to generate a coarse global prediction. Then, a refinement subnetwork composed of a series of recurrent convolution layers is used to refine the coarse prediction map from coarse scales to fine scales.

In Ref. [155], Tang and Wu consider both regionlevel saliency estimation and pixel-level saliency prediction. For pixel-level prediction, two side paths are connected to the last two stages of the VGGNet and then concatenated to learn multi-scale features.For region-level estimation, each given image is first over-segmented into multiple superpixels and then the Clarifai model[163]is used to predict the saliency of each superpixel. The original image and the two prediction maps are taken as the inputs to a small CNN to generate a more convincing saliency map as the final output.

Tang et al. [156] take the deeply supervised net[164] and adopt a similar architecture as in the holistically-nested edge detector [142]. Unlike HED,they replace the original convolutional layers in VGGNet with recurrent convolutional layers to learn local, global, and contextual information.

In Ref.[153],Kuen et al. propose a two-stage CNN by utilizing spatial transformer and recurrent network units. A convolutional-deconvolutional network is first used to produce an initial coarse saliency map.The spatial transformer network [165] is applied to extract multiple sub-regions from the original images, followed by a series of recurrent network units to progressively refine the predictions of these sub-regions.

Kruthiventi et al. [154] consider both fixation prediction and salient object detection in a unified network. To capture multi-scale semantic information,four inception modules [143] are introduced which are connected to the output of the 2nd, 4th, 5th, and 6th stages, respectively. These four side paths are concatenated and passed through a small network composed of two convolutional layers to reduce the aliasing effect of upsampling. Finally, the sigmoid cross entropy loss is used to optimize the model.

Li et al. [157] consider joint semantic segmentation and salient object detection. As in the FCN work[70],the two original fully connected layers in VGGNet[152] are replaced by convolutional layers. To overcome the fuzzy object boundaries caused by the down-sampling operations of CNNs,they make use of the SLIC [166] superpixels to model the topological relationships between superpixels in both spatial and feature dimensions. Finally, graph Laplacian regularized nonlinear regression is used to change the combination of the predictions from CNNs and the superpixel graph from the coarse level to the fine level.

Zhang et al. [158] detect salient objects using saliency cues extracted by CNNs and a multi-level fusion mechanism. The Deeplab [167] architecture is first used to capture high-level features. To address the problem of large strides in Deeplab, a multi-scale binary pixel labeling method is adopted to improve spatial coherence, as in Ref. [47].

The MSRNet [159] by Li et al. performs both salient object detection and instance-level salient object segmentation. A multi-scale CNN is used to simultaneously detect salient regions and contours.For each scale, features from upper layers are merged with features from lower layers to gradually refine the results. To generate a contour map, the MCG[168] approach is used to extract a small number of candidate bounding boxes and well-segmented regions that are used to help perform salient object instance segmentation. Finally, a fully connected CRF model [169] is employed to refine the spatial coherence.

Hou et al. [49] design a top-down model based on the HED architecture [142]. Instead of connecting independent side paths to the last convolutional layer of each stage, a series of short connections are introduced to build a strong relationship between each pair of side paths. As a result, features from upper layers with strongly semantic information are propagated to lower layers, helping them accurately locate exact positions of salient objects. In the meantime,rich detailed information from lower layers allow irregular prediction maps from deeper layers to be refined. A special fusion mechanism is exploited to better combine the saliency maps predicted by different side paths.

Discussion. The foregoing approaches are all based on fully convolutional networks, which enable point-to-point learning and end-to-end training strategies. Compared to CCN-based models, these methods make better use of the convolution operation and substantially decrease the time cost. More importantly, recent FCN-based approaches [49, 159]that utilize CNN features greatly outperform those methods with segment-level information.

To sum up, the three following advantages are obtained in utilizing FCN-based models for saliency detection:

1. Local versus global. As mentioned in Section

2.2.1, earlier CNN-based models incorporate both local and global contextual information explicitly (embedded in separate networks [45-47])or implicitly(using an end-to-end framework).This indeed agrees with the design principles behind many hand-crafted cues reviewed in previous sections. However, FCN-based methods are capable of learning both local and global information internally. Lower layers tend to encode more detailed information such as edge and fine components, while deeper layers favor global and semantically meaningful information.Such properties enable FCN-based networks to drastically outperform classic methods.

2. Pre-training and fine-tuning. The effectiveness of fine-tuning a pre-trained network has been demonstrated in many different applications.The network is typically pre-trained on the ImageNet dataset [170] for image classification.The learned knowledge can be applied to several different target tasks (e.g., object detection [161],object localization [171]) through simple finetuning. A similar strategy has been adopted for salient object detection [46, 151] and has resulted in superior performance compared to training from scratch. The learned features,more importantly, are able to capture high-level semantic knowledge about object categories, as the employed networks are pre-trained for scene and object classification tasks.

3. Versatile architectures. A CNN architecture is formed by a stack of distinct layers that transform the input images into an output map through a differentiable function. The diversity of FCNs allows designers to design different structures that are appropriate for them.

Despite great success, FCN-based models still fail in several cases. Typical examples include scenes with transparent objects, low contrast between foreground and background,and complex backgrounds,as shown in Ref. [49]. This calls for development of more powerful architectures in future.

Figure 5 provides a visual comparison of maps generated by classic and CNN-based models.

3Applications of salient object detection

The value of salient object detection models lies in their application to many areas of computer vision,graphics,and robotics. Salient object detection models have been utilized for several applications such as object detection and recognition [180-186], image and video compression[187,188],video summarization[189-191],photo collage/media re-targeting/cropping/thumbnailing [174, 192, 193], image quality assessment [194-196], image segmentation [197-200], content-based image retrieval and image collection browsing [177,201-203], image editing and manipulation [41, 175,178, 179], visual tracking [204-210], object discovery[211, 212], and human-robot interaction [213, 214].Figure 6 shows example applications.

Fig. 5 Visual comparisons of two best classic methods (DRFI and DSR), according to Ref. [132], and two leading CNN-based methods (MDF and DSS).

4 Datasets and evaluation measures

4.1 Salient object detection datasets

As more and more models have been proposed in the literature, more datasets have been introduced to further challenge saliency detection models. Early attempts aim to collect images with salient objects being annotated with bounding boxes (e.g., MSRAA and MSRA-B [25]), while later efforts annotate such salient objects with pixel-wise binary masks(e.g., ASD [37] and DUT-OMRON [97]). Typically,images, which can be annotated with accurate masks, contain few objects (usually one) and simple background regions. On the contrary,recent attempts have been made to collect datasets with multiple objects in complex scenes with cluttered backgrounds(e.g., Refs. [22, 23, 26]). As already noted, a more sophisticated mechanism is required to determine the most salient object when several candidate objects are present in the same scene. For example, Borji[23] and Li et al. [22] use the peak of the human fixation map to determine which object is the most salient (i.e., the one that humans look at the most;see Section 1.2).

A list of 22 salient object datasets including 20 image datasets and 2 video datasets is provided in Table 6. Notice that all images or video frames in these datasets are annotated with binary masks or rectangles. Subjects are often asked to label a single salient object in an image (e.g., Ref. [25]) or to annotate the most salient among several candidate objects (e.g., Ref. [26]). Some image datasets also provide for each image the fixation data collected during a free-viewing task.

Fig. 6 Sample applications of salient object detection.

Table 6 Overview of popular salient object datasets. Above: image datasets, below: video datasets. Obj: objects per image, Ann: Annotation,Sbj: Subjects/Annotators, Eye: Eye tracking subjects, I/V: Image/Video

4.2 Evaluation measures

Five universally-agreed, standard, and easy-tocompute measures for evaluating salient object detection models are described next. For simplicity,we use S to represent the predicted saliency map normalized to [0,255] and G to be the ground-truth binary mask of salient objects. For a binary mask,we use|·|to represent the number of non-zero entries in the mask.

4.2.1 Precision-recall (PR)

A saliency map S is first converted to a binary mask M and then Precision and Recall are computed by comparing M to the ground-truth G:

Binarization of S is the key step in the evaluation.There are three popular ways to perform binarization.In the first solution, Achanta et al. [37] propose image-dependent adaptive threshold for binarizing S,computed as twice as the mean saliency of S:

where W and H are the width and the height of the saliency map S, respectively.

The second way to binarize S is to use a threshold that varies from 0 to 255. For each threshold, a pair of (precision, recall) scores are computed and used to plot a precision-recall (PR) curve.

The third way to perform binarization is to use a GrabCut-like algorithm (e.g., as in Ref. [84]). Here,the PR curve is first computed and the threshold that leads to 95% recall is selected. With this threshold,an initial binary mask is generated,which is then used to initialize iterative GrabCut segmentation [138] to gradually refine the binary mask.

4.2.2 F-measure

Often, neither precision nor recall can fully evaluate the quality of a saliency map. Instead,the F-measure is used, defined as the weighted harmonic mean of precision and recall with a non-negative weight β2:

In many salient object detection works(e.g.,Ref.[37]),β2is set to 0.3 to give greater weight to precision:recall rate is not as important as precision (see also Ref. [55]). For instance, 100% recall can be easily achieved by setting the whole map to be foreground.

4.2.3 Receiver operating characteristics (ROC)curve

In the above, false positive rate (FPR) and true positive rate(TPR)can be computed when binarizing the saliency map with a set of fixed thresholds:

where ˉM and ˉG denote the complement of the binary mask M and ground-truth G, respectively. The ROC curve is the plot of TPR versus FPR for all possible thresholds.

4.2.4 Area under ROC curve (AUC)

While the ROC is a 2D representation of a model's performance, the AUC distils this information into a single number. As the name implies, it is calculated as the area under the ROC curve. A perfect model will score an AUC of 1, while random guessing will score an AUC of around 0.5.

4.2.5 Mean absolute error (MAE)

The overlap-based evaluation measures introduced above do not consider true negative saliency assignments, i.e., the pixels correctly marked as non-salient.

They favor methods that successfully assign high saliency to salient pixels but fail to detect non-salient regions. Moreover, for some applications [227], the quality of the weighted continuous saliency maps may be of higher interest than the binary masks. For a more comprehensive comparison, it is recommended to evaluate the mean absolute error (MAE) between the continuous saliency map S and the binary groundtruth G, both normalized to the range [0, 1]. The MAE score is defined as

Please refer to Ref. [228] for more details on datasets and scores in the filed of salient object detection. Code for evaluation measures is available at http://mmcheng.net/salobjbenchmark.

5 Discussion

5.1 Design choices

In the past two decades, hundreds of classic and deep learning based methods have been proposed for detecting and segmenting salient objects in scenes,and a large number of design choices have been explored. Although great success has been achieved recently, there is still large room for improvement.Our detailed method summarization (see Table 1 and Table 2) sends some clear messages about the commonly used design choices,and these are valuable for the design of future algorithms, as we now discuss.

5.1.1 Heuristics versus learning from data

Early methods were mainly based on heuristic cues(local or global) to detect salient objects [27, 37, 84,97]. Recently, saliency models based on learning algorithms have shown to be very effective(see Table 1 and Table 2). Among these models, deep learning based methods greatly outperform conventional heuristic methods because of their ability to learn large amounts of extrinsic cues from large datasets.Data-driven approaches for salient object detection seem to have surprisingly good generalization ability.An emerging question, however, is whether the datadriven ideas for salient object detection conflict with the ease of use of these models. Most learning based approaches are only trained on a small subset of theMSRA5Kdataset,and still consistently outperform other methods on all other datasets which have considerable differences. This suggests that it is worth further exploring data-driven salient object detection without losing the advantages of simplicity and ease-of-use, in particular from an application point of view.

5.1.2 Hand-crafted versus CNN-based features

The first generation of learning-based methods were based on many hand-crafted features. An obvious drawback of these methods is their generalizability,especially when applied to complex cluttered scenes.In addition, these methods mainly rely on oversegmentation algorithms,such as SLIC[166],yielding incomplete salient objects having high contrast components. CNN-based models solve these problems, to some degree, even when complex scenes are considered. Because of their ability to learn multi-level features, it is easy for CNNs to accurately locate salient objects. Low-level features such as edges enable sharpening boundaries of salient objects while high-level features allow incorporating semantic information to identify salient objects.

5.1.3 Recent advances in CNN-based saliency detection

Various CNN-based architectures have been proposed recently. Among these approaches, there are several promising choices that can be further explored in future. The first one regards models with deep supervision. As shown in Ref. [49], deeply supervised networks strengthen the power of features in different layers. The second choice is the encoderdecoder architecture, which has been adopted in many segmentation-related tasks. Such approaches gradually back-propagate high-level features to lower layers, allowing effective fusion of multi-level features.Another choice is to exploit stronger baseline models,such as using very deep ResNets [229] instead of VGGNet [152].

5.2 Dataset bias

Datasets have been important in the rapid progress in saliency detection. On one hand, they supply large scale training data and enable performance comparisons of competing algorithms. On the other hand, each dataset is a unique sampling of an unlimited application domain, and contains a certain degree of bias.

To date, there seems to be a unanimous agreement on the presence of bias (i.e., skew) in underlying structures of datasets. Consequently, some studies have addressed the effect of bias in image datasets.For instance, Torralba and Efros identify three biases in computer vision datasets, namely: selection bias,capture bias,and negative set bias[230]. Selection bias is caused by preference for a particular kind of image during data gathering. It results in qualitatively similar images in a dataset. This is witnessed by the strong color contrast (see Refs. [22, 84]) in most frequently used salient object benchmark datasets[37]. Thus, two practices in dataset construction are to be preferred: i)having independent image selection and annotation processes [22], and ii) detecting the most salient object first and then segmenting it.

Negative set bias is the consequence of a lack of a rich and unbiased negative set, i.e., one should avoid concentrating on a particular image of interest and datasets should represent the whole world.Negative set bias may affect the ground-truth by incorporating the annotator's personal preferences for some object types. Thus, including a variety of images is encouraged when constructing a good dataset. Capture bias conveys the effect of image composition on the dataset. The most popular kind of such a bias is the tendency to compose images with important objects in the central region of the image,i.e., center bias. The existence of bias in a dataset makes quantitative comparisons very challenging and sometimes even misleading. For instance, a trivial saliency model which consists of a Gaussian blob at the image center often scores higher than many fixation prediction models [63, 231, 232].

5.3 Future directions

Several promising research directions for constructing more effective models and benchmarks are discussed here.

5.3.1 Beyond single images

Most benchmarks and saliency models discussed in this study deal with single images. Unfortunately,salient object detection on multiple input images,e.g., salient object detection on video sequences, cosalient object detection, and salient object detection over depth and light field images, are less explored.One reason behind this is the limited availability of benchmark datasets for these problems. For example, as mentioned in Section 4, there are only two publicly available benchmark datasets for video saliency (mostly comprising cartoons and news). For these videos,only bounding boxes are provided for the key frames to roughly localize salient objects. Multimodal data is becoming increasingly more accessible and affordable. Integrating additional cues such as spatio-temporal consistency and depth will be beneficial for efficient salient object detection.

5.3.2 Instance-level salient object detection

Existing saliency models are object-agnostic(i.e.,they do not split salient regions into objects). However,humans possess the capability to detect salient objects at instance level. Instance-level saliency can be useful in several applications, such as image editing and video compression.

Two possible approaches for instance-level saliency detection are as follows. The first uses an object detection or object proposal method, e.g., Fast-RCNN [161], to extract a stack of object bounding box candidates and then segment salient objects within them. The second approach,initially proposed in Ref. [159], is to leverage edge information to distinguish different salient objects.

5.3.3 Versatile network architectures

With the deeper understanding of researchers on CNNs, more and more interesting network architectures have been developed. Using advanced baseline models and network architectures [151] can substantially improve the performance. On one hand,deeper networks help better capture salient objects because of their ability to extract high-level semantic information. On the other hand, apart from highlevel information, low-level features [49, 159] should also be considered to build high resolution saliency maps.

5.3.4 Unanswered questions

Some remaining questions include: how many(salient) objects are necessary to represent a scene?Does map smoothing affect the scores and model ranking? How is salient object detection different from other fields? What is the best way to tackle center bias in model evaluation? What is the remaining gap between models and humans? A collaborative engagement with other related fields such as saliency for fixation prediction, scene labeling and categorization, semantic segmentation, object detection, and object recognition can help answer these questions, situate the field better, and identify future directions.

6 Summary and conclusions

In this paper, we have exhaustively reviewed the salient object detection literature with respect to closely related areas. Detecting and segmenting salient objects is very useful. Objects in images automatically capture more attention than background items,such as grass,trees,and sky. Therefore,if we can detect salient or important objects first, we can perform detailed reasoning and scene understanding in the next stage. Compared to traditional special-purpose object detectors, saliency models are general, typically fast, and do not need heavy annotation. These properties allow processing of a large number of images at low cost.

Exploring connections between salient object detection and fixation prediction models can help enhance performance for both types of models. In this regard, datasets that offer both salient object judgements of humans and eye movements are highly desirable. Conducting behavioral studies to understand how humans perceive and prioritize objects in scenes and how this concept is related to language, scene description and captioning,visual question answering, attributes, etc., can offer invaluable insights. Further, it is critical to focus more on evaluating and comparing salient object models to gauge future progress. Tackling dataset biases such as center bias and selection bias and moving towards more challenging images is important.

Although salient object detection and segmentation methods have made great strides in recent years,a very robust salient object detection algorithm that can generate high quality results for nearly all images is still lacking. Even for humans, what is the most salient object in the image, is sometimes a quite ambiguous question. To this end, a general suggestion:

Don't ask what segments can do for you, ask what you can do for the segments①http://www.cs.berkeley.edu/$sim$malik/student-tree-2010.pdf.

- Jitendra Malik

is particularly important when attempting to build robust algorithms. For instance, when dealing with noisy Internet images, although salient object detection and segmentation methods do not guarantee robust performance on individual images,their efficiency and simplicity make it possible to automatically process a large number of images. This allows the filtering of images for the purposes of reliability and accuracy,running applications robustly[84, 174, 175, 177, 179, 233], and unsupervised learning [176].

Computational Visual Media2019年2期

Computational Visual Media的其它文章: Livestock detection in aerial images using a fully convolutional network; Automated brain tumor segmentation on multi-modal MR image using SegNet; No-reference synthetic image quality assessment with convolutional neural network and local image saliency; Optimal and interactive keyframe selection for motion capture; Seamless and non-repetitive 4D texture variation synthesis and real-time rendering for measured optical material behavior; A method for estimating the errors in many-light rendering with supersampling

感谢您访问我们的网站，您可能还对以下资源感兴趣：温州秤旁教育咨询有限公司

99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看网站地图

亚洲中文字幕一区二区三区有码在线看 22中文网久久字幕搞女人的毛片 18禁动态无遮挡网站免费大片18禁欧美潮喷喷水亚洲色图av天堂亚洲三级黄色毛片 18禁裸乳无遮挡免费网站照片国产av不卡久久亚洲国产欧美在线一区老师上课跳d突然被开到最大视频啦啦啦啦在线视频资源国产免费福利视频在线观看午夜视频国产福利国产精品久久电影中文字幕十八禁国产超污无遮挡网站蜜桃久久精品国产亚洲av 十八禁国产超污无遮挡网站成人亚洲欧美一区二区av av国产免费在线观看乱码一卡2卡4卡精品亚洲国产欧美人成日本色播在线视频国产精品一区二区在线观看99 好男人在线观看高清免费视频男的添女的下面高潮视频噜噜噜噜噜久久久久久91 日日摸夜夜添夜夜爱黄色日韩在线色综合亚洲欧美另类图片超碰av人人做人人爽久久国产在视频线在精品淫秽高清视频在线观看尾随美女入室神马国产精品三级电影在线观看国产69精品久久久久777片综合色丁香网色视频www国产美女cb高潮喷水在线观看日韩在线高清观看一区二区三区禁无遮挡网站秋霞在线观看毛片日韩欧美精品免费久久国产又色又爽无遮挡免午夜老司机福利剧场精品国产三级普通话版国产淫语在线视频国产淫语在线视频国产一区有黄有色的免费视频 26uuu在线亚洲综合色少妇猛男粗大的猛烈进出视频 91av网一区二区大又大粗又爽又黄少妇毛片口 .国产精品久久七月丁香在线播放亚洲国产欧美人成国产极品天堂在线久久99热这里只有精品18 午夜福利成人在线免费观看久久久久久久久久久免费av 免费av毛片视频搡老妇女老女人老熟妇 97超碰精品成人国产 a级毛色黄片国产一区二区亚洲精品在线观看边亲边吃奶的免费视频亚洲不卡免费看国产又黄又爽又无遮挡在线国产精品1区2区在线观看. 婷婷六月久久综合丁香欧美变态另类bdsm刘玥国产精品福利在线免费观看国产精品精品国产色婷婷免费观看精品视频网站国产国拍精品亚洲av在线观看免费观看的影片在线观看 1000部很黄的大片简卡轻食公司亚洲人成网站在线播亚洲美女搞黄在线观看五月伊人婷婷丁香亚洲精品乱码久久久久久按摩 22中文网久久字幕 99在线视频只有这里精品首页一边摸一边抽搐一进一小说亚洲久久久久久中文字幕一级爰片在线观看 av天堂中文字幕网 91精品伊人久久大香线蕉国产av不卡久久 99在线人妻在线中文字幕成人午夜高清在线视频欧美三级亚洲精品亚洲欧美精品综合久久99 国模一区二区三区四区视频亚洲精品,欧美精品三级国产精品欧美在线观看欧美人与善性xxx 欧美3d第一页 18禁在线无遮挡免费观看视频美女cb高潮喷水在线观看身体一侧抽搐国产精品久久久久久精品电影国产黄片视频在线免费观看日韩成人伦理影院欧美zozozo另类高清午夜精品一区二区三区好男人视频免费观看在线国产伦一二天堂av在线观看男的添女的下面高潮视频女人被狂操c到高潮赤兔流量卡办理一区二区三区四区激情视频麻豆av噜噜一区二区三区国产伦在线观看视频一区国产精品福利在线免费观看午夜亚洲福利在线播放秋霞在线观看毛片国产黄a三级三级三级人在线免费观看不下载黄p国产国产成人精品一,二区天天躁日日操中文字幕边亲边吃奶的免费视频亚洲性久久影院国产在线一区二区三区精免费不卡的大黄色大毛片视频在线观看熟妇人妻久久中文字幕3abv 99久久九九国产精品国产免费直男gayav资源 91在线精品国自产拍蜜月国产视频首页在线观看成年版毛片免费区亚洲人成网站在线观看播放久久99热这里只有精品18 久久久久久久久久久丰满国产乱人偷精品视频久久久久精品久久久久真实原创 av在线蜜桃九九在线视频观看精品国产真实乱freesex 一个人观看的视频www高清免费观看国产成人精品婷婷人人妻人人澡人人爽人人夜夜干丝袜人妻中文字幕欧美成人精品欧美一级黄精品99又大又爽又粗少妇毛片欧美精品一区二区大全国产av在哪里看 99视频精品全部免费在线亚洲在线自拍视频好男人视频免费观看在线在线播放国产精品三级联通29元200g的流量卡 a级毛色黄片 kizo精华嫩草影院入口久久久色成人亚洲av成人精品一区久久 99热这里只有是精品50 99久久精品国产国产毛片欧美bdsm另类精品国产一区二区三区久久久樱花精品一区二区三区视频在线免费观看性生交大片5 亚洲国产高清在线一区二区三 18禁裸乳无遮挡免费网站照片色吧在线观看欧美zozozo另类 99久久精品一区二区三区精品一区二区三区人妻视频热99在线观看视频国产黄色小视频在线观看久久久久免费精品人妻一区二区中文在线观看免费www的网站国产精品人妻久久久久久国产精品.久久久男女国产视频网站高清毛片免费看国产成人福利小说国产乱人视频国产伦精品一区二区三区视频9 国产精品无大码特级一级黄色大片午夜激情福利司机影院淫秽高清视频在线观看精品不卡国产一区二区三区国产午夜精品一二区理论片级片在线观看亚洲久久久久久中文字幕最近中文字幕2019免费版女人十人毛片免费观看3o分钟午夜亚洲福利在线播放 av在线天堂中文字幕变态另类丝袜制服亚洲av二区三区四区日韩一区二区视频免费看免费看a级黄色片成人二区视频禁无遮挡网站麻豆成人午夜福利视频欧美激情国产日韩精品一区一级黄色大片毛片能在线免费观看的黄片久久精品国产亚洲av涩爱精品不卡国产一区二区三区午夜福利网站1000一区二区三区 videossex国产久久热精品热久久精品国产亚洲av涩爱亚洲精品乱码久久久久久按摩国产成人a区在线观看日本与韩国留学比较熟女人妻精品中文字幕免费无遮挡裸体视频一个人免费在线观看电影午夜激情欧美在线 99久久中文字幕三级久久日本又粗又爽又猛毛片免费看精品一区二区免费观看亚洲欧美一区二区三区国产亚洲高清免费不卡视频联通29元200g的流量卡国产精品久久久久久久久免国产一区欧美日韩成年版毛片免费区日韩av在线大香蕉日韩国内少妇激情av av在线天堂中文字幕桃色一区二区三区在线观看一区二区三区免费毛片 18禁动态无遮挡网站午夜福利高清视频麻豆乱淫一区二区午夜a级毛片国产精品久久久久久久电影在线天堂最新版资源欧美日韩精品成人综合77777 免费人成在线观看视频色亚洲在线自拍视频久久精品久久精品一区二区三区亚洲无线观看免费亚洲国产精品成人综合色男女视频在线观看网站免费色哟哟·www 久久鲁丝午夜福利片搡女人真爽免费视频火全软件欧美+日韩+精品精品一区二区三区视频在线午夜精品在线福利亚洲欧美精品专区久久亚洲av男天堂我的老师免费观看完整版 www.av在线官网国产久久久久久九九精品二区国产欧美变态另类bdsm刘玥最近最新中文字幕大全电影3 女人被狂操c到高潮日本免费在线观看一区欧美激情久久久久久爽电影 99热精品在线国产色噜噜av男人的天堂激情 2022亚洲国产成人精品成人亚洲精品av一区二区搡女人真爽免费视频火全软件熟女人妻精品中文字幕国产在线一区二区三区精 a级毛片免费高清观看在线播放欧美区成人在线视频噜噜噜噜噜久久久久久91 午夜视频国产福利精品酒店卫生间中文字幕免费在线视频6 18禁动态无遮挡网站人体艺术视频欧美日本欧美三级亚洲精品国产成人a∨麻豆精品午夜爱爱视频在线播放成人亚洲精品av一区二区一边摸一边抽搐一进一小说亚洲av成人av av专区在线播放噜噜噜噜噜久久久久久91 欧美一区二区精品小视频在线水蜜桃什么品种好国产伦一二天堂av在线观看女的被弄到高潮叫床怎么办国产亚洲91精品色在线 99在线人妻在线中文字幕国产精品国产三级专区第一集 99久久人妻综合少妇丰满av 女人久久www免费人成看片成人一区二区视频在线观看毛片一级片免费看久久久久菩萨蛮人人尽说江南好唐韦庄狂野欧美白嫩少妇大欣赏 ponron亚洲 av女优亚洲男人天堂狂野欧美白嫩少妇大欣赏欧美激情在线99 晚上一个人看的免费电影美女高潮的动态插逼视频在线观看热99在线观看视频精品久久国产蜜桃成人二区视频深夜a级毛片国产亚洲av片在线观看秒播厂欧美丝袜亚洲另类国产高清三级在线国产亚洲5aaaaa淫片日本免费在线观看一区久久久午夜欧美精品国产成人91sexporn 人妻夜夜爽99麻豆av 身体一侧抽搐一级av片app 黄片无遮挡物在线观看日本五十路高清 99热6这里只有精品亚洲图色成人 97超视频在线观看视频中文字幕亚洲精品专区婷婷色av中文字幕直男gayav资源成年女人看的毛片在线观看精品久久久久久成人av 国产综合懂色 av天堂中文字幕网亚洲怡红院男人天堂免费看日本二区永久免费av网站大全国产精品久久久久久精品电影国产色婷婷99 色哟哟·www 亚洲精华国产精华液的使用体验日本-黄色视频高清免费观看国产精品国产三级国产专区5o 九九热线精品视视频播放 ponron亚洲最近最新中文字幕免费大全7 极品教师在线视频亚洲精华国产精华液的使用体验久久精品久久久久久噜噜老黄看免费成人av毛片亚洲真实伦在线观看精品国产一区二区三区久久久樱花国产不卡一卡二亚洲av成人精品一二三区看黄色毛片网站在线天堂最新版资源熟妇人妻久久中文字幕3abv 日产精品乱码卡一卡2卡三 av线在线观看网站十八禁国产超污无遮挡网站午夜亚洲福利在线播放婷婷色麻豆天堂久久免费av不卡在线播放国产精品国产三级国产专区5o 欧美高清性xxxxhd video 欧美性感艳星久久久精品大字幕三级男女做爰猛烈吃奶摸视频色噜噜av男人的天堂激情成人鲁丝片一二三区免费亚洲精品自拍成人 av在线观看视频网站免费久久久a久久爽久久v久久亚洲国产欧美在线一区中文资源天堂在线亚洲经典国产精华液单亚洲av成人精品一区久久国产熟女欧美一区二区非洲黑人性xxxx精品又粗又长久久精品国产亚洲av涩爱一级二级三级毛片免费看亚洲国产高清在线一区二区三男人舔女人下体高潮全视频精品无人区乱码1区二区国国产精品蜜臀av免费色综合色国产高清日韩中文字幕在线亚洲欧美精品自产自拍午夜免费激情av 成年av动漫网址少妇熟女欧美另类日韩亚洲欧美综合久久国内精品自在自线图片亚洲三级黄色毛片免费在线观看成人毛片一级二级三级毛片免费看国产视频内射国产av码专区亚洲av 免费看美女性在线毛片视频国产精品永久免费网站成人三级黄色视频国产探花极品一区二区国产欧美另类精品又又久久亚洲欧美国产精品女同一区二区软件校园人妻丝袜中文字幕欧美不卡视频在线免费观看国产精品爽爽va在线观看网站中文字幕av成人在线电影我的女老师完整版在线观看免费看日本二区赤兔流量卡办理九九在线视频观看精品国产精品久久久久久久久免又粗又爽又猛毛片免费看免费搜索国产男女视频日韩一区二区视频免费看婷婷六月久久综合丁香 97超碰精品成人国产亚洲一区高清亚洲精品 av在线天堂中文字幕我的女老师完整版在线观看精品人妻熟女av久视频少妇丰满av 精品一区二区三区视频在线夫妻性生交免费视频一级片天美传媒精品一区二区欧美精品一区二区大全乱码一卡2卡4卡精品大香蕉97超碰在线一个人免费在线观看电影精品久久久久久成人av 欧美性猛交黑人性爽久久精品夜夜夜夜夜久久蜜豆深爱激情五月婷婷 99在线人妻在线中文字幕亚洲av电影不卡..在线观看 av专区在线播放亚洲精品日韩av片在线观看亚洲国产最新在线播放 av播播在线观看一区 18+在线观看网站欧美xxxx性猛交bbbb 亚洲国产精品国产精品亚洲av电影在线观看一区二区三区永久免费av网站大全亚洲人成网站在线播亚洲婷婷狠狠爱综合网久久亚洲精品不卡亚洲精品日韩av片在线观看 91精品伊人久久大香线蕉亚洲,欧美,日韩国产伦一二天堂av在线观看日韩精品青青久久久久久一区二区三区乱码不卡18 成人午夜精彩视频在线观看午夜福利网站1000一区二区三区免费观看精品视频网站久久久久久大精品日韩精品有码人妻一区亚洲人成网站在线播在线观看美女被高潮喷水网站高清日韩中文字幕在线老司机福利观看丰满人妻一区二区三区视频av 老师上课跳d突然被开到最大视频日韩亚洲欧美综合久久99热这里只有精品18 精品一区二区免费观看亚洲精品乱码久久久v下载方式国产亚洲精品久久久com 欧美另类亚洲清纯唯美男女啪啪激烈高潮av片 97热精品久久久久久大话2 男鬼变身卡一本一本综合久久 a级毛片免费高清观看在线播放亚洲av免费在线观看久久亚洲国产成人精品v 美女xxoo啪啪120秒动态图中文天堂在线官网色5月婷婷丁香 91精品一卡2卡3卡4卡自拍偷自拍亚洲精品老妇色吧在线观看日韩三级伦理在线观看国产黄色视频一区二区在线观看久久久久性生活片天堂√8在线中文国产成人精品久久久久久亚洲av日韩在线播放色综合站精品国产国产免费福利视频在线观看精品欧美国产一区二区三一个人免费在线观看电影 av卡一久久亚洲av熟女最近手机中文字幕大全免费av不卡在线播放免费观看的影片在线观看亚洲人成网站高清观看中文精品一卡2卡3卡4更新亚洲18禁久久av 夜夜看夜夜爽夜夜摸 18禁动态无遮挡网站亚洲最大成人av 精品一区二区免费观看国产在视频线在精品日日摸夜夜添夜夜爱婷婷色麻豆天堂久久你懂的网址亚洲精品在线观看 97在线视频观看国产精品综合久久久久久久免费国语自产精品视频在线第100页久久精品国产亚洲网站久久久久久九九精品二区国产欧美不卡视频在线免费观看纵有疾风起免费观看全集完整版国产精品爽爽va在线观看网站草草在线视频免费看真实男女啪啪啪动态图午夜a级毛片久久久亚洲精品成人影院国产精品乱码一区二三区的特点最后的刺客免费高清国语热99re8久久精品国产欧美极品一区二区三区四区高清在线视频一区二区三区成人综合一区亚洲 .国产精品久久国产精品蜜桃在线观看国产片特级美女逼逼视频成人二区视频久99久视频精品免费国产高清不卡午夜福利亚洲内射少妇av 你懂的网址亚洲精品在线观看国产亚洲5aaaaa淫片色噜噜av男人的天堂激情欧美日韩精品成人综合77777 免费av观看视频一区二区三区四区激情视频国产精品久久视频播放亚洲天堂国产精品一区在线亚洲在线观看片国产精品无大码听说在线观看完整版免费高清亚洲国产日韩欧美精品在线观看欧美成人一区二区免费高清观看 22中文网久久字幕亚洲五月天丁香 91午夜精品亚洲一区二区三区日韩大片免费观看网站 a级毛片免费高清观看在线播放 18+在线观看网站亚洲欧美中文字幕日韩二区国产色爽女视频免费观看搡女人真爽免费视频火全软件韩国高清视频一区二区三区午夜爱爱视频在线播放日日干狠狠操夜夜爽国产高清三级在线亚洲在久久综合麻豆乱淫一区二区天堂中文最新版在线下载精品人妻一区二区三区麻豆一级爰片在线观看国内揄拍国产精品人妻在线免费人成在线观看视频色精品人妻熟女av久视频欧美成人午夜免费资源乱码一卡2卡4卡精品亚洲精品乱码久久久v下载方式在线播放无遮挡少妇猛男粗大的猛烈进出视频色吧在线观看国产精品一区二区在线观看99 亚洲美女视频黄频亚洲欧美日韩高清专用中文乱码字字幕精品一区二区三区一级毛片电影观看秋霞伦理黄片国产不卡一卡二国产亚洲91精品色在线久久国产乱子免费精品麻豆久久精品国产亚洲av 亚洲av福利一区成人性生交大片免费视频hd 两个人的视频大全免费全区人妻精品视频毛片一级片免费看久久久久久久久久久久久久久免费av 国产免费一级a男人的天堂美女内射精品一级片tv 国产精品一及搡老妇女老女人老熟妇 99久久精品热视频热99re8久久精品国产国产亚洲一区二区精品国产美女午夜福利看片在线看免费视频亚洲精品aⅴ在线观看国产一区二区在线观看日韩国产精品国产高清国产av 简卡轻食公司七月丁香在线播放午夜激情福利司机影院 a级毛色黄片日本猛色少妇xxxxx猛交久久国产精华一区二区三区亚洲aⅴ乱码一区二区在线播放国产亚洲精品av在线欧美最新免费一区二区三区国产v大片淫在线免费观看 97在线视频观看国产精品蜜桃在线观看欧美zozozo另类午夜老司机福利剧场久久6这里有精品国产亚洲精品av在线中文资源天堂在线欧美bdsm另类国产精品熟女久久久久浪国产一区二区在线av高清观看高清在线视频一区二区三区国内精品宾馆在线特大巨黑吊av在线直播男女国产视频网站男插女下体视频免费在线播放啦啦啦观看免费观看视频高清少妇丰满av 国产激情偷乱视频一区二区建设人人有责人人尽责人人享有的亚洲av成人精品一区久久五月玫瑰六月丁香亚洲四区av 男人舔女人下体高潮全视频国产精品一区二区三区四区免费观看国产伦理片在线播放av一区身体一侧抽搐亚洲国产色片午夜a级毛片 videos熟女内射少妇的逼水好多成人国产麻豆网乱系列少妇在线播放男人狂女人下面高潮的视频婷婷色麻豆天堂久久美女xxoo啪啪120秒动态图 videos熟女内射久久精品熟女亚洲av麻豆精品精品久久久久久成人av 91久久精品国产一区二区成人麻豆一二三区av精品十八禁国产超污无遮挡网站少妇的逼好多水尤物成人国产欧美一区二区三区搞女人的毛片日韩av在线大香蕉舔av片在线久久精品国产亚洲av天美人人妻人人澡人人爽人人夜夜国产精品国产三级国产专区5o 国产精品久久久久久精品电影亚洲av电影在线观看一区二区三区亚洲综合色惰成年免费大片在线观看 av卡一久久亚洲国产精品合色在线国产av一区在线观看免费免费观看a级毛片全部你懂的网址亚洲精品在线观看国产精品电影一区二区三区我要看日韩黄色一级片日韩精品有码人妻一区午夜福利在线在线亚洲av成人精品一二三区成人综合一区亚洲国产精品99久久久久久久久久久精品综合一区二区三区少妇高潮的动态图看免费成人av毛片 97超碰精品成人国产午夜精品国产一区二区电影在线免费观看的www视频欧美极品一区二区三区四区九九在线视频观看精品亚洲图色成人国产美女午夜福利 97在线视频观看亚洲婷婷狠狠爱综合网亚洲自拍偷在线中国国产av一级久久久精品欧美日韩精品国产在视频线在精品国产亚洲av嫩草精品影院亚洲精品久久久久久婷婷小说欧美一区二区亚洲超碰av人人做人人爽久久日韩国内少妇激情av 欧美变态另类bdsm刘玥亚洲最大成人av 狂野欧美白嫩少妇大欣赏亚洲国产精品专区欧美免费一级毛片在线播放高清视频美女被艹到高潮喷水动态国产亚洲午夜精品一区二区久久国产精品久久久久久精品电影小说国产精品野战在线观看亚洲欧美清纯卡通最近2019中文字幕mv第一页日韩欧美国产在线观看中国国产av一级 99在线视频只有这里精品首页乱码一卡2卡4卡精品午夜激情欧美在线久久精品久久久久久久性亚洲成人中文字幕在线播放色综合站精品国产久久久久久伊人网av 国产老妇女一区中文字幕av成人在线电影插阴视频在线观看视频男人舔女人下体高潮全视频 2021少妇久久久久久久久久久 99热6这里只有精品在线免费观看的www视频国产精品国产三级专区第一集色网站视频免费爱豆传媒免费全集在线观看婷婷色av中文字幕 99热网站在线观看午夜av观看不卡成人毛片60女人毛片免费亚洲国产av影院在线观看国产精品秋霞免费鲁丝片咕卡用的链子免费看av在线观看网站欧美精品一区二区大全免费黄网站久久成人精品日韩,欧美,国产一区二区三区久久久久久久久久久免费av 熟女电影av网午夜精品国产一区二区电影久久久久久久亚洲中文字幕女性被躁到高潮视频又大又黄又爽视频免费夜夜骑夜夜射夜夜干国产精品无大码 18+在线观看网站男女边吃奶边做爰视频亚洲图色成人免费观看无遮挡的男女亚洲欧美色中文字幕在线 97在线人人人人妻亚洲av电影在线观看一区二区三区亚洲人成网站在线观看播放考比视频在线观看亚洲五月色婷婷综合制服诱惑二区亚洲精品美女久久av网站中国美白少妇内射xxxbb 亚洲av综合色区一区少妇人妻久久综合中文我要看黄色一级片免费的日本vs欧美在线观看视频亚洲精品日韩在线中文字幕久久精品国产a三级三级三级 tube8黄色片一级黄片播放器 av女优亚洲男人天堂伦理电影大哥的女人热99国产精品久久久久久7 男男h啪啪无遮挡国国产精品蜜臀av免费性色avwww在线观看国产乱人偷精品视频亚洲av.av天堂看十八女毛片水多多多免费大片18禁亚洲天堂av无毛 av黄色大香蕉亚洲精品国产av蜜桃国产极品天堂在线亚洲精品久久成人aⅴ小说国产色爽女视频免费观看 99re6热这里在线精品视频又黄又爽又刺激的免费视频. 在线观看人妻少妇 videossex国产免费在线观看完整版高清中文字幕人妻丝袜制服在线观看一区二区三区激情亚洲精品第二区麻豆精品久久久久久蜜桃大话2 男鬼变身卡午夜免费观看性视频中文乱码字字幕精品一区二区三区亚洲精品av麻豆狂野国产乱人偷精品视频男女下面插进去视频免费观看精品国产一区二区三区四区第35 最新的欧美精品一区二区亚洲精品视频女赤兔流量卡办理侵犯人妻中文字幕一二三四区国产成人精品婷婷久久久久久久大尺度免费视频两个人看的免费小视频男女高潮啪啪啪动态图秋霞伦理黄片亚洲欧洲国产日韩 90打野战视频偷拍视频久久这里有精品视频免费国产免费视频播放在线视频亚洲精品色激情综合 a 毛片基地两个人看的免费小视频国产成人av激情在线播放蜜臀久久99精品久久宅男老女人水多毛片亚洲精品av麻豆狂野 97在线人人人人妻 91午夜精品亚洲一区二区三区 97人妻天天添夜夜摸自线自在国产av 国产成人免费无遮挡视频黄色一级大片看看午夜久久久在线观看欧美国产精品va在线观看不卡国产国语露脸激情在线看国产精品不卡视频一区二区久久精品国产自在天天线免费大片黄手机在线观看婷婷色av中文字幕一二三四在线观看免费中文在日本91视频免费播放国产黄色视频一区二区在线观看国产精品麻豆人妻色哟哟久久 1024视频免费在线观看色婷婷久久久亚洲欧美婷婷色综合www 成人亚洲欧美一区二区av 另类亚洲欧美激情 av福利片在线 90打野战视频偷拍视频伦精品一区二区三区 99热国产这里只有精品6 黑丝袜美女国产一区国产男人的电影天堂91 精品久久久久久电影网多毛熟女@视频在线观看国产h片丝袜喷水一区国产亚洲精品久久久com 国产精品欧美亚洲77777 免费看光身美女亚洲欧美中文字幕日韩二区国产成人91sexporn 久久亚洲国产成人精品v 欧美激情国产日韩精品一区成人二区视频有码亚洲区边亲边吃奶的免费视频午夜日本视频在线高清不卡的av网站国产高清三级在线 18在线观看网站 av女优亚洲男人天堂国内精品宾馆在线美女视频免费永久观看网站亚洲精品乱久久久久久日韩av免费高清视频精品少妇内射三级免费黄网站久久成人精品热99国产精品久久久久久7 免费人妻精品一区二区三区视频男女国产视频网站国产日韩欧美视频二区中文乱码字字幕精品一区二区三区日日摸夜夜添夜夜爱精品卡一卡二卡四卡免费日韩伦理黄色片国产精品熟女久久久久浪亚洲综合精品二区男人添女人高潮全过程视频 9191精品国产免费久久国产男人的电影天堂91 国产永久视频网站国产成人91sexporn 日韩,欧美,国产一区二区三区中文字幕另类日韩欧美亚洲嫩草国产精品久久久久久av不卡欧美日本中文国产一区发布国产无遮挡羞羞视频在线观看两个人免费观看高清视频在线观看www视频免费大香蕉97超碰在线 99热这里只有是精品在线观看日韩人妻精品一区2区三区欧美人与性动交α欧美软件成人毛片a级毛片在线播放久久韩国三级中文字幕色网站视频免费少妇被粗大猛烈的视频精品少妇黑人巨大在线播放桃花免费在线播放亚洲av男天堂久热这里只有精品99 在线 av 中文字幕国产免费福利视频在线观看中文字幕人妻熟女乱码国产伦理片在线播放av一区久久国产精品大桥未久av 国产男女超爽视频在线观看 √禁漫天堂资源中文www 熟女人妻精品中文字幕看非洲黑人一级黄片久久久精品94久久精品男的添女的下面高潮视频色吧在线观看日韩欧美精品免费久久制服诱惑二区国精品久久久久久国模美亚洲精华国产精华液的使用体验满18在线观看网站婷婷色av中文字幕欧美亚洲国产日韩一国产av精品麻豆在线观看免费视频网站a站日韩av不卡免费在线播放女性被躁到高潮视频一区二区三区精品91 丝袜人妻中文字幕国产男人的电影天堂91 最新中文字幕久久久久亚洲色图男人天堂中文字幕久久青草综合色国产欧美日韩一区二区三区在线 26uuu在线亚洲综合色国产精品一区www在线观看美女大奶头黄色视频天堂中文最新版在线下载国产一区亚洲一区在线观看男女午夜视频在线观看国产精品免费大片亚洲,一卡二卡三卡免费黄色在线免费观看亚洲国产日韩一区二区久久久精品区二区三区 av在线app专区 99热全是精品在线免费观看不下载黄p国产少妇被粗大的猛进出69影院 9色porny在线观看 kizo精华成人亚洲欧美一区二区av 一区在线观看完整版精品一品国产午夜福利视频亚洲精品乱久久久久久波多野结衣一区麻豆国产精品偷伦视频观看了一本一本久久a久久精品综合妖精国产伦在线观看视频一区亚洲av日韩在线播放 a级片在线免费高清观看视频色婷婷av一区二区三区视频精品99又大又爽又粗少妇毛片波多野结衣一区麻豆熟女av电影日本免费在线观看一区成人影院久久国产精品蜜桃在线观看中文字幕精品免费在线观看视频久久这里只有精品19 av播播在线观看一区久久99一区二区三区性高湖久久久久久久久免费观看 2018国产大陆天天弄谢成人午夜精彩视频在线观看搡老乐熟女国产国产老妇伦熟女老妇高清亚洲精品日韩在线中文字幕水蜜桃什么品种好亚洲国产欧美日韩在线播放日韩制服骚丝袜av 久久免费观看电影 1024视频免费在线观看久久久a久久爽久久v久久日韩一区二区三区影片晚上一个人看的免费电影欧美精品人与动牲交sv欧美 18禁在线无遮挡免费观看视频男女无遮挡免费网站观看啦啦啦中文免费视频观看日本免费看av在线观看网站少妇人妻视频 av在线app专区免费日韩欧美在线观看性高湖久久久久久久久免费观看搡女人真爽免费视频火全软件少妇猛男粗大的猛烈进出视频久久av网站午夜日本视频在线人人妻人人爽人人添夜夜欢视频男女下面插进去视频免费观看日本免费在线观看一区国产亚洲精品第一综合不卡日本91视频免费播放欧美3d第一页国产精品国产av在线观看自拍欧美九色日韩亚洲蝌蚪91 久久久精品94久久精品久久99热6这里只有精品日本黄色日本黄色录像最近的中文字幕免费完整午夜影院在线不卡国产免费一区二区三区四区乱码免费黄网站久久成人精品日韩一本色道免费dvd 国产精品99久久99久久久不卡丝袜在线中文字幕久久av网站精品一区二区三卡丝袜脚勾引网站精品卡一卡二卡四卡免费 99久久人妻综合国产日韩欧美视频二区亚洲精品成人av观看孕妇视频在线观看一区二区三区亚洲精品久久午夜乱码狠狠精品人妻久久久久久综合国产精品99久久99久久久不卡一级a做视频免费观看欧美激情极品国产一区二区三区晚上一个人看的免费电影国产成人91sexporn 亚洲精品456在线播放app 午夜影院在线不卡 av.在线天堂国产又色又爽无遮挡免国产精品国产三级国产av玫瑰宅男免费午夜亚洲av福利一区国产精品偷伦视频观看了国产av一区二区精品久久 18禁观看日本亚洲丝袜综合中文字幕 91久久精品国产一区二区三区丰满饥渴人妻一区二区三 999精品在线视频免费观看性生交大片5 av黄色大香蕉少妇精品久久久久久久国产免费视频播放在线视频欧美精品亚洲一区二区久久这里只有精品19 国产一区二区激情短视频黑人欧美特级aaaaaa片久久久久久久大尺度免费视频国产在线免费精品飞空精品影院首页国产精品1 老女人水多毛片亚洲精品久久久久久婷婷小说在线观看免费日韩欧美大片最近的中文字幕免费完整日韩视频在线欧美美女内射精品一级片tv 制服丝袜香蕉在线免费大片黄手机在线观看免费不卡的大黄色大毛片视频在线观看久久女婷五月综合色啪小说精品人妻熟女毛片av久久网站天天操日日干夜夜撸欧美精品av麻豆av 18禁国产床啪视频网站夫妻性生交免费视频一级片妹子高潮喷水视频成人国产麻豆网 av播播在线观看一区在线观看人妻少妇免费大片18禁狠狠婷婷综合久久久久久88av 91精品三级在线观看亚洲欧美中文字幕日韩二区亚洲丝袜综合中文字幕亚洲四区av 人妻人人澡人人爽人人亚洲一码二码三码区别大吗国产av国产精品国产五月开心婷婷网午夜福利,免费看成人二区视频 av一本久久久久啦啦啦在线观看免费高清www 日日爽夜夜爽网站激情视频va一区二区三区美女内射精品一级片tv 五月开心婷婷网在线亚洲精品国产二区图片欧美欧美xxⅹ黑人日本猛色少妇xxxxx猛交久久 91精品伊人久久大香线蕉大香蕉97超碰在线免费高清在线观看日韩欧美国产精品一级二级三级丝袜脚勾引网站 www日本在线高清视频伦精品一区二区三区国产精品久久久久成人av 午夜影院在线不卡精品卡一卡二卡四卡免费国产日韩欧美在线精品亚洲国产精品专区欧美国产午夜精品一二区理论片肉色欧美久久久久久久蜜桃美女主播在线视频 av在线老鸭窝女人被躁到高潮嗷嗷叫费观黑丝袜美女国产一区超碰97精品在线观看国产男女内射视频欧美日本中文国产一区发布亚洲性久久影院欧美成人午夜免费资源大码成人一级视频国产精品人妻久久久影院国产成人午夜福利电影在线观看日韩亚洲欧美在线看免费成人av毛片色网站视频免费九九爱精品视频在线观看 99国产综合亚洲精品黄色怎么调成土黄色亚洲欧美成人精品一区二区五月开心婷婷网五月伊人婷婷丁香大香蕉久久网国产白丝娇喘喷水9色精品人妻少妇偷人精品九色国产免费视频播放在线视频啦啦啦视频在线资源免费观看桃花免费在线播放亚洲第一区二区三区不卡九色成人免费人妻av 国产精品一区www在线观看国产淫语在线视频人妻一区二区av 国产欧美日韩一区二区三区在线免费看光身美女精品一区二区三区视频在线国产成人精品在线电影欧美人与性动交α欧美软件日韩在线高清观看一区二区三区国产成人精品在线电影我要看黄色一级片免费的国产69精品久久久久777片在线亚洲精品国产二区图片欧美亚洲欧美一区二区三区国产亚洲精华国产精华液的使用体验久久精品国产鲁丝片午夜精品一级片'在线观看视频欧美激情高清一区二区三区 91精品国产国语对白视频久久久久久久久久久久大奶天堂中文最新版在线下载日韩成人av中文字幕在线观看 av免费在线看不卡 freevideosex欧美久久狼人影院国产一区亚洲一区在线观看在线观看免费日韩欧美大片亚洲av福利一区久热这里只有精品99 一级片'在线观看视频女的被弄到高潮叫床怎么办久久亚洲国产成人精品v 欧美bdsm另类宅男免费午夜国产一区亚洲一区在线观看 a级毛色黄片欧美日韩视频精品一区男女下面插进去视频免费观看国产精品女同一区二区软件午夜激情av网站国产极品天堂在线久久精品国产a三级三级三级一本一本久久a久久精品综合妖精国产伦在线观看视频一区高清视频免费观看一区二区国产片内射在线十分钟在线观看高清视频www 国产精品国产三级专区第一集日韩,欧美,国产一区二区三区欧美激情国产日韩精品一区母亲3免费完整高清在线观看男女啪啪激烈高潮av片 av天堂久久9 91国产中文字幕 80岁老熟妇乱子伦牲交日本av手机在线免费观看 90打野战视频偷拍视频国产深夜福利视频在线观看亚洲欧美色中文字幕在线一本一本久久a久久精品综合妖精国产伦在线观看视频一区性色av一级视频中文字幕在线观看超色免费av 宅男免费午夜女人被躁到高潮嗷嗷叫费观国产免费又黄又爽又色国产一区精品欧美国产精品va在线观看不卡亚洲在久久综合亚洲第一av免费看女人精品久久久久毛片一区二区三区四区激情视频一边亲一边摸免费视频午夜激情av网站久久99蜜桃精品久久日韩成人av中文字幕在线观看一区二区三区四区激情视频国产成人a∨麻豆精品国产亚洲一区二区精品国产日韩欧美在线精品 97超碰精品成人国产建设人人有责人人尽责人人享有的秋霞在线观看毛片国产精品一国产av 国产片特级美女逼逼视频国产精品久久久久久精品古装美女国产高潮福利片在线看 99热这里只有是精品在线观看国产精品久久久久久精品电影小说国产有黄有色有爽视频亚洲精品第二区日韩在线高清观看一区二区三区国产精品蜜桃在线观看乱码一卡2卡4卡精品亚洲精品自拍成人成年人午夜在线观看视频亚洲精品第二区欧美人与性动交α欧美精品济南到免费久久久久久久精品成人欧美视频亚洲精品第二区日日爽夜夜爽网站欧美少妇被猛烈插入视频丰满乱子伦码专区男女午夜视频在线观看 97人妻天天添夜夜摸成人无遮挡网站一边摸一边做爽爽视频免费 91国产中文字幕亚洲色图综合在线观看满18在线观看网站国产欧美另类精品又又久久亚洲欧美久久久精品区二区三区久久久久精品久久久久真实原创久久人人爽av亚洲精品天堂国产免费福利视频在线观看嫩草影院入口国产在线一区二区三区精男女边摸边吃奶国产亚洲精品第一综合不卡少妇在线观看黄色视频在线播放观看不卡大片电影免费在线观看免费你懂的网址亚洲精品在线观看一级爰片在线观看国产成人精品无人区高清视频免费观看一区二区 av黄色大香蕉国产精品人妻久久久影院亚洲精品一二三男女下面插进去视频免费观看天天影视国产精品久久精品久久久久久噜噜老黄插逼视频在线观看精品午夜福利在线看成年人免费黄色播放视频 videosex国产 av线在线观看网站成人亚洲欧美一区二区av 久久婷婷青草少妇的逼好多水 18禁动态无遮挡网站日韩不卡一区二区三区视频在线两个人看的免费小视频热99久久久久精品小说推荐精品国产国语对白av 女人被躁到高潮嗷嗷叫费观少妇人妻久久综合中文 av电影中文网址国产一区二区在线观看av 51国产日韩欧美亚洲精品一区蜜桃国产淫语在线视频亚洲经典国产精华液单亚洲欧洲国产日韩国产成人a∨麻豆精品欧美日韩av久久黄色一级大片看看午夜av观看不卡国内精品宾馆在线夫妻性生交免费视频一级片乱人伦中国视频亚洲,一卡二卡三卡日本wwww免费看又黄又粗又硬又大视频另类精品久久黑人高潮一二区久久精品国产亚洲av天美老司机亚洲免费影院亚洲色图综合在线观看欧美日韩视频精品一区久热这里只有精品99 国精品久久久久久国模美亚洲欧美精品自产自拍热re99久久国产66热成人亚洲精品一区在线观看熟妇人妻不卡中文字幕亚洲三级黄色毛片视频在线观看一区二区三区国产不卡av网站在线观看狠狠精品人妻久久久久久综合丰满饥渴人妻一区二区三亚洲精品久久久久久婷婷小说在线观看一区二区三区激情久久久久久人人人人人亚洲国产成人一精品久久久亚洲精品第二区 av.在线天堂久久青草综合色男女无遮挡免费网站观看 99国产综合亚洲精品 av在线老鸭窝亚洲精品456在线播放app 美女大奶头黄色视频亚洲av欧美aⅴ国产亚洲精品美女久久av网站久久久国产欧美日韩av 亚洲色图综合在线观看 999精品在线视频最近最新中文字幕大全免费视频欧美日韩精品国产亚洲国产欧美在线一区欧美精品人与动牲交sv欧美一二三四在线观看免费中文在亚洲精品国产av成人精品午夜av观看不卡国产成人午夜福利电影在线观看亚洲精品国产av成人精品亚洲四区av 久久婷婷青草 18在线观看网站免费久久久久久久精品成人欧美视频日韩一区二区视频免费看国产激情久久老熟女免费观看a级毛片全部哪个播放器可以免费观看大片性色av一级 videosex国产亚洲av中文av极速乱色网站视频免费国产国语露脸激情在线看亚洲三级黄色毛片 h视频一区二区三区欧美日韩亚洲高清精品亚洲国产色片日韩精品有码人妻一区咕卡用的链子久久人人爽av亚洲精品天堂 9色porny在线观看巨乳人妻的诱惑在线观看 99热全是精品不卡视频在线观看欧美国产欧美日韩一区二区三区在线国产麻豆69 免费看av在线观看网站久久狼人影院视频在线观看一区二区三区国产一区二区在线观看av 制服诱惑二区极品人妻少妇av视频国产伦理片在线播放av一区亚洲欧美成人综合另类久久久欧美国产精品一级二级三级青春草亚洲视频在线观看一区二区三区四区激情视频 18禁在线无遮挡免费观看视频日韩中文字幕视频在线看片蜜桃在线观看.. 日韩精品免费视频一区二区三区精品一区在线观看国产久久久久国产网址大香蕉久久成人网国产精品国产三级专区第一集人人妻人人澡人人看久久久久精品久久久久真实原创欧美精品一区二区免费开放午夜激情久久久久久久精品少妇黑人巨大在线播放中文字幕精品免费在线观看视频一区二区三区四区激情视频亚洲成人av在线免费 99re6热这里在线精品视频国产成人91sexporn 91成人精品电影午夜影院在线不卡建设人人有责人人尽责人人享有的亚洲第一av免费看亚洲,欧美,日韩日韩中字成人亚洲av免费高清在线观看国产精品人妻久久久久久大码成人一级视频伦理电影免费视频久久久精品区二区三区看免费av毛片日本免费在线观看一区在线精品无人区一区二区三亚洲综合色网址婷婷成人精品国产国产又色又爽无遮挡免最近最新中文字幕大全免费视频中文乱码字字幕精品一区二区三区天美传媒精品一区二区大陆偷拍与自拍午夜福利视频在线观看免费亚洲精华国产精华液的使用体验少妇人妻视频韩国高清视频一区二区三区久久久久精品性色欧美精品一区二区大全国产1区2区3区精品亚洲av免费高清在线观看亚洲av免费高清在线观看日日爽夜夜爽网站亚洲人成网站在线观看播放 av国产精品久久久久影院 2021少妇久久久久久久久久久 av网站免费在线观看视频日本黄大片高清人妻系列视频国产免费又黄又爽又色丰满少妇做爰视频久久99热6这里只有精品国产欧美另类精品又又久久亚洲欧美人人澡人人妻人久久久久精品性色免费黄网站久久成人精品一级,二级,三级黄色视频国产乱人偷精品视频国产精品女同一区二区软件男人添女人高潮全过程视频午夜激情久久久久久久一区二区三区精品91 涩涩av久久男人的天堂午夜老司机福利剧场国产激情久久老熟女狂野欧美激情性xxxx在线观看国产69精品久久久久777片男女高潮啪啪啪动态图国产精品秋霞免费鲁丝片美女国产视频在线观看 91成人精品电影欧美亚洲日本最大视频资源中文天堂在线官网男女国产视频网站亚洲中文av在线午夜视频国产福利狠狠婷婷综合久久久久久88av 国产精品一国产av 国产精品一二三区在线看中文字幕精品免费在线观看视频一区二区三区四区激情视频 51国产日韩欧美日本欧美国产在线视频 av在线播放精品 av不卡在线播放亚洲av在线观看美女高潮在现免费观看毛片有码亚洲区日韩一区二区三区影片国产亚洲一区二区精品人妻少妇偷人精品九色成人手机av 亚洲第一区二区三区不卡夫妻午夜视频 www.熟女人妻精品国产国产成人91sexporn 成人毛片a级毛片在线播放亚洲av福利一区亚洲国产精品一区二区三区在线自拍欧美九色日韩亚洲蝌蚪91 欧美丝袜亚洲另类久久久久久人人人人人麻豆乱淫一区二区亚洲精品美女久久久久99蜜臀婷婷色综合www 91aial.com中文字幕在线观看男女下面插进去视频免费观看久久免费观看电影欧美精品一区二区免费开放亚洲av在线观看美女高潮日本猛色少妇xxxxx猛交久久少妇人妻久久综合中文少妇精品久久久久久久国产爽快片一区二区三区中文字幕人妻熟女乱码精品一品国产午夜福利视频极品少妇高潮喷水抽搐涩涩av久久男人的天堂欧美日韩视频精品一区中文字幕人妻丝袜制服国产精品久久久av美女十八一级a做视频免费观看亚洲综合色惰 97人妻天天添夜夜摸久久精品aⅴ一区二区三区四区亚洲,欧美精品. 国产一区二区三区综合在线观看免费人成在线观看视频色多毛熟女@视频久久久久人妻精品一区果冻亚洲欧美成人精品一区二区国产一区二区在线观看av 国产一区精品亚洲欧美清纯卡通 97精品久久久久久久久久精品国产精品国内视频免费观看在线日韩午夜福利在线观看免费完整高清在天美传媒精品一区二区久久久久网色 www.av在线官网国产我的女老师完整版在线观看成年人午夜在线观看视频婷婷色麻豆天堂久久欧美日韩精品国产性色avwww在线观看免费黄网站久久成人精品日韩精品免费视频一区二区三区成年美女黄网站色视频大全免费久久精品久久久久久噜噜老黄男男h啪啪无遮挡日韩欧美精品免费久久亚洲久久久国产精品日日摸夜夜添夜夜爱亚洲综合色网址 18禁观看日本午夜福利影视在线免费观看一区二区三区四区激情视频成人漫画全彩无遮挡欧美精品一区二区大全久久韩国三级中文字幕亚洲国产毛片av蜜桃av 日韩在线高清观看一区二区三区久热这里只有精品99 日日爽夜夜爽网站中文字幕最新亚洲高清亚洲精品美女久久久久99蜜臀久久人人爽人人爽人人片va 精品人妻偷拍中文字幕中文字幕免费在线视频6 色94色欧美一区二区亚洲四区av 丝袜脚勾引网站国精品久久久久久国模美久久久精品免费免费高清 freevideosex欧美少妇被粗大猛烈的视频国产成人午夜福利电影在线观看十八禁高潮呻吟视频国产一区亚洲一区在线观看久久久国产一区二区久久久久久久久久久久大奶国产午夜精品一二区理论片一本色道久久久久久精品综合超色免费av 男人添女人高潮全过程视频成年动漫av网址中文字幕人妻熟女乱码在现免费观看毛片婷婷色av中文字幕精品熟女少妇av免费看另类精品久久伊人久久国产一区二区免费久久久久久久精品成人欧美视频国产片内射在线精品久久国产蜜桃久久久精品94久久精品天堂中文最新版在线下载韩国av在线不卡 99热这里只有是精品在线观看日本黄色日本黄色录像 9热在线视频观看99 啦啦啦在线观看免费高清www 欧美成人午夜精品在线亚洲精品国产二区图片欧美日本黄色日本黄色录像 80岁老熟妇乱子伦牲交色婷婷av一区二区三区视频日本av手机在线免费观看成人影院久久国产综合精华液一本久久精品欧美日韩av久久麻豆精品久久久久久蜜桃久久久久久伊人网av 免费高清在线观看视频在线观看亚洲高清免费不卡视频 18禁裸乳无遮挡动漫免费视频热99国产精品久久久久久7 一级毛片黄色毛片免费观看视频搡老乐熟女国产亚洲欧美日韩卡通动漫高清av免费在线又粗又硬又长又爽又黄的视频激情视频va一区二区三区欧美日韩视频高清一区二区三区二久久青草综合色久久久久国产精品人妻一区二区女性生殖器流出的白浆国产精品女同一区二区软件亚洲第一区二区三区不卡国产精品国产三级国产av玫瑰欧美人与性动交α欧美软件久久热在线av 美女xxoo啪啪120秒动态图熟女av电影精品久久久久久电影网大码成人一级视频人人妻人人添人人爽欧美一区卜 av.在线天堂日本wwww免费看 av有码第一页日本黄大片高清日韩一本色道免费dvd 少妇精品久久久久久久最后的刺客免费高清国语午夜福利视频在线观看免费 av网站免费在线观看视频免费av不卡在线播放热re99久久精品国产66热6 久久久a久久爽久久v久久性色av一级国产淫语在线视频国产精品久久久久成人av 欧美日韩一区二区视频在线观看视频在线日韩视频在线欧美内地一区二区视频在线 99久久综合免费亚洲人与动物交配视频亚洲av.av天堂国产精品一国产av 国产极品天堂在线欧美最新免费一区二区三区中文精品一卡2卡3卡4更新韩国av在线不卡超碰97精品在线观看日日爽夜夜爽网站亚洲av电影在线观看一区二区三区亚洲av男天堂校园人妻丝袜中文字幕国产一区二区在线观看日韩久久99热6这里只有精品日本午夜av视频亚洲欧美日韩另类电影网站亚洲国产av新网站亚洲成av片中文字幕在线观看亚洲少妇的诱惑av 色哟哟·www 自拍欧美九色日韩亚洲蝌蚪91 欧美国产精品一级二级三级国产在线一区二区三区精欧美亚洲国产日韩一男男h啪啪无遮挡亚洲av男天堂午夜老司机福利剧场少妇高潮的动态图曰老女人黄片午夜福利网站1000一区二区三区亚洲经典国产精华液单最近中文字幕高清免费大全6 久久精品国产鲁丝片午夜精品国产女主播在线喷水免费视频网站美女脱内裤让男人舔精品视频两个人免费观看高清视频亚洲熟女精品中文字幕久久精品国产亚洲av天美国产极品天堂在线 97超碰精品成人国产国语对白做爰xxxⅹ性视频网站男人爽女人下面视频在线观看曰老女人黄片香蕉丝袜av 免费黄网站久久成人精品青春草视频在线免费观看亚洲第一区二区三区不卡综合色丁香网交换朋友夫妻互换小说国产精品熟女久久久久浪午夜免费观看性视频久久久久精品久久久久真实原创欧美变态另类bdsm刘玥亚洲精品乱码久久久久久按摩国产熟女欧美一区二区男女啪啪激烈高潮av片曰老女人黄片 51国产日韩欧美高清欧美精品videossex 热re99久久精品国产66热6 亚洲精品美女久久av网站九色成人免费人妻av 男人添女人高潮全过程视频精品酒店卫生间久久久久国产网址久久精品aⅴ一区二区三区四区国产高清三级在线国产av国产精品国产女人久久www免费人成看片国产成人精品婷婷亚洲精品色激情综合人妻亚洲视频十八禁高潮呻吟视频欧美精品国产亚洲国产黄色免费在线视频 99热6这里只有精品午夜精品国产一区二区电影毛片一级片免费看久久久久 a级毛色黄片午夜激情av网站国产精品一区二区在线不卡亚洲美女搞黄在线观看日本猛色少妇xxxxx猛交久久 av天堂久久9 xxxhd国产人妻xxx 色网站视频免费人妻一区二区av 国产高清三级在线黄网站色视频无遮挡免费观看久久精品aⅴ一区二区三区四区国产色婷婷99 女的被弄到高潮叫床怎么办一级毛片黄色毛片免费观看视频欧美xxⅹ黑人精品熟女少妇av免费看 av电影中文网址 9191精品国产免费久久 18+在线观看网站黄色视频在线播放观看不卡十八禁网站网址无遮挡欧美激情高清一区二区三区大片免费播放器马上看亚洲第一av免费看久久国产亚洲av麻豆专区 av电影中文网址亚洲欧洲国产日韩中文字幕另类日韩欧美亚洲嫩草日韩欧美一区视频在线观看日韩一本色道免费dvd 伊人亚洲综合成人网 69精品国产乱码久久久精品人妻一区二区三区麻豆 2018国产大陆天天弄谢国产在线一区二区三区精十分钟在线观看高清视频www 国产乱人偷精品视频国产极品天堂在线亚洲国产最新在线播放天美传媒精品一区二区免费看光身美女国产精品99久久99久久久不卡日本爱情动作片www.在线观看亚洲图色成人精品久久久久久电影网亚洲人成77777在线视频男女下面插进去视频免费观看宅男免费午夜 99热这里只有是精品在线观看精品99又大又爽又粗少妇毛片最近最新中文字幕免费大全7 久久热在线av 午夜福利网站1000一区二区三区国产成人午夜福利电影在线观看人人妻人人澡人人爽人人夜夜两个人免费观看高清视频亚洲欧洲国产日韩国产精品一区二区在线不卡狂野欧美激情性xxxx在线观看成人手机av 欧美精品人与动牲交sv欧美九色成人免费人妻av 久久精品aⅴ一区二区三区四区国产精品嫩草影院av在线观看色视频在线一区二区三区 2022亚洲国产成人精品日日撸夜夜添街头女战士在线观看网站九色亚洲精品在线播放一级黄片播放器成人漫画全彩无遮挡国产成人午夜福利电影在线观看欧美日韩亚洲高清精品你懂的网址亚洲精品在线观看日韩av在线免费看完整版不卡欧美亚洲丝袜人妻在线精品国产乱码久久久久久小说男女下面插进去视频免费观看国产精品熟女久久久久浪男女午夜视频在线观看性色avwww在线观看 av线在线观看网站最黄视频免费看国产精品1 最近手机中文字幕大全国产色婷婷99 久久久久精品性色 √禁漫天堂资源中文www 国产免费视频播放在线视频国产日韩欧美在线精品少妇高潮的动态图国产精品久久久久成人av 好男人视频免费观看在线日韩制服丝袜自拍偷拍免费女性裸体啪啪无遮挡网站精品国产露脸久久av麻豆 freevideosex欧美 av片东京热男人的天堂天堂8中文在线网国产白丝娇喘喷水9色精品 av在线观看视频网站免费 99久久中文字幕三级久久日本日本黄色日本黄色录像丝袜美足系列男人操女人黄网站亚洲av在线观看美女高潮伦精品一区二区三区 97在线视频观看一级毛片电影观看一级,二级,三级黄色视频国产综合精华液日本欧美国产在线视频久久久久国产网址国产爽快片一区二区三区久久久欧美国产精品久久99热这里只频精品6学生久久久久久久久久久免费av 精品一区二区三区四区五区乱码亚洲精品美女久久av网站国产一区亚洲一区在线观看中文字幕精品免费在线观看视频成年动漫av网址欧美老熟妇乱子伦牲交国产乱人偷精品视频久久精品国产综合久久久日韩亚洲欧美在线久久99热6这里只有精品成人亚洲精品一区在线观看国产成人免费无遮挡视频草草在线视频免费看欧美bdsm另类 97在线人人人人妻欧美激情国产日韩精品一区如日韩欧美国产精品一区二区三区亚洲图色成人国产精品女同一区二区软件国语对白做爰xxxⅹ性视频网站免费观看无遮挡的男女亚洲国产毛片av蜜桃av 女人精品久久久久毛片大香蕉久久网久久精品久久精品一区二区三区 99久国产av精品国产电影日韩在线高清观看一区二区三区中文精品一卡2卡3卡4更新 26uuu在线亚洲综合色人体艺术视频欧美日本高清av免费在线 91精品伊人久久大香线蕉午夜视频国产福利国产成人欧美久久精品国产鲁丝片午夜精品亚洲国产欧美日韩在线播放久久99热这里只频精品6学生咕卡用的链子成人午夜精彩视频在线观看久久久精品免费免费高清黄网站色视频无遮挡免费观看最近中文字幕2019免费版国产无遮挡羞羞视频在线观看少妇人妻久久综合中文亚洲精品美女久久av网站精品第一国产精品国产免费一区二区三区四区乱码飞空精品影院首页中文字幕av电影在线播放国产亚洲精品第一综合不卡一级爰片在线观看午夜影院在线不卡亚洲精品久久久久久婷婷小说日韩,欧美,国产一区二区三区乱码一卡2卡4卡精品亚洲人成网站在线观看播放国产一区精品亚洲av日韩在线播放天天影视国产精品人人妻人人澡人人爽人人夜夜女人被躁到高潮嗷嗷叫费观国产精品久久久久久av不卡综合色丁香网香蕉精品网在线国产精品久久久久成人av 久久精品人人爽人人爽视色亚洲熟女精品中文字幕亚洲欧美日韩卡通动漫 av在线老鸭窝欧美性感艳星国产男女超爽视频在线观看日韩成人伦理影院狂野欧美激情性xxxx在线观看少妇被粗大猛烈的视频 av在线app专区男女边摸边吃奶国产免费福利视频在线观看色吧在线观看亚洲精品乱久久久久久国产精品国产三级国产av玫瑰久久这里只有精品19 色婷婷久久久亚洲欧美欧美精品人与动牲交sv欧美免费女性裸体啪啪无遮挡网站久久99精品国语久久久国产欧美日韩综合在线一区二区欧美精品人与动牲交sv欧美中文字幕人妻熟女乱码久久久a久久爽久久v久久亚洲内射少妇av 如何舔出高潮亚洲婷婷狠狠爱综合网日本黄色日本黄色录像天天影视国产精品人人妻人人澡人人看看非洲黑人一级黄片青青草视频在线视频观看 2022亚洲国产成人精品纯流量卡能插随身wifi吗美女主播在线视频日日爽夜夜爽网站精品99又大又爽又粗少妇毛片色哟哟·www 久久久久久人人人人人欧美3d第一页国产极品粉嫩免费观看在线午夜福利乱码中文字幕内地一区二区视频在线国产精品.久久久久久精品久久久久久久性伦理电影免费视频一本—道久久a久久精品蜜桃钙片 91精品国产国语对白视频国产片特级美女逼逼视频免费观看性生交大片5 日本wwww免费看日本欧美国产在线视频国产免费视频播放在线视频只有这里有精品99 菩萨蛮人人尽说江南好唐韦庄人妻一区二区av 欧美日韩综合久久久久久亚洲国产毛片av蜜桃av 最近2019中文字幕mv第一页岛国毛片在线播放色网站视频免费亚洲综合色惰国产亚洲一区二区精品国产亚洲精品久久久com 久久久久精品久久久久真实原创国产精品人妻久久久久久在线亚洲精品国产二区图片欧美一级片'在线观看视频伦理电影大哥的女人欧美xxⅹ黑人日韩成人av中文字幕在线观看国产在线免费精品久久99精品国语久久久在线观看免费日韩欧美大片久久久久久人妻不卡视频在线观看欧美日韩亚洲欧美在线中文字幕精品免费在线观看视频亚洲色图综合在线观看日韩一本色道免费dvd 国产在线视频一区二区国产午夜精品一二区理论片国产在视频线精品免费黄色在线免费观看精品久久久精品久久久国产成人a∨麻豆精品校园人妻丝袜中文字幕 xxx大片免费视频免费观看无遮挡的男女人人妻人人添人人爽欧美一区卜人妻亚洲视频岛国毛片在线播放

一个人看片免费亚洲精品乱码爱久久久久免费观看亚洲一区二区