Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Jizheng Xu, Beijing CN

Jizheng Xu, Beijing CN

Patent application number	Description	Published
20080292005	Enhancement layer switching for scalable video coding - An exemplary system includes a data encoder generating a base layer bitstream encoded at a base bit-rate, and a plurality of enhancement layer bitstreams encoded at different enhancement layer bit-rates, and a bitstream selection module selecting one of the enhancement layer bitstreams every video frame based on available channel bandwidth. A method includes transmitting a first enhancement layer bitstream encoded at a first bit-rate, detecting a transition in network bandwidth through a switching bit-rate, and transmitting a second enhancement layer bitstream encoded at a second bit-rate based on the transition in network bandwidth.	11-27-2008
20080310506	Joint Spatio-Temporal Prediction for Video Coding - Systems and methods are described for joint spatio-temporal prediction (JSTP) during video coding. A given block of pixels is simultaneously predicted via inter frame prediction and intra frame prediction. The joint spatio-temporal prediction provides a new JSTP coding mode besides inter and intra modes in coding schemata such as the H.264/AVC video coding standard. In one implementation, subband decomposition obtains a high-pass component of inter frame prediction of a block and a low-pass component of intra frame prediction of the same block. The high-pass component of the inter prediction and the low-pass component of the intra prediction are combined to produce the joint spatio-temporal prediction. In one implementation, a difference between the original block and the JSTP prediction is coded in a bitstream to represent the block.	12-18-2008
20100020882	Barbell Lifting for Wavelet Coding - A method for encoding motion-compensated video data includes generating, for a current frame, a high-pass wavelet coefficient based on a function of pixels in a temporally adjacent frame. The operations are repeated for multiple pixels in an array of pixels in the current frame to form an array of high-pass wavelet coefficients. A low-pass wavelet coefficient is generated based on a function of the high-pass wavelet coefficients. A system for coding video data includes a temporal wavelet decomposition module decomposing a pixel into a high-pass coefficient by performing a discrete wavelet transform on the pixel, a function of pixels in a previous frame, and/or a function of pixels in a subsequent frame. The system includes a motion estimation module generating motion vectors associated with the pixels in the previous frame and in the subsequent frame.	01-28-2010
20120076414	External Image Based Summarization Techniques - Techniques involve visually summarizing documents (e.g., search results, a collection of documents, etc.) using images which are visually representative of the documents for which the images represent. The images representing the documents may be external images obtained from sources other than the documents. The external images may be obtained from the sources other than the documents by performing a separate image based search using key phrases from the documents rather than extracting the images directly from within the documents themselves. Alternatively, an algorithm may be used to determine an image type, which may be chosen from a selection of external images, thumbnail images, or internal imaged taken directly from the collection of documents, that is suited to represent each document in the collection of documents. A snippet of the documents may be displayed along with the images which visually represent each of the documents.	03-29-2012
20120189056	Video Coding Redundancy Reduction - Embodiments for reducing redundancy in video coding are disclosed. In accordance with at least one embodiment, video content is represented as a tree structure in which the nodes of the tree structure are associated with attributes of the video content. An encoder analyzes the nodes to determine whether there is redundancy associated with a last node of a divided node. In another embodiment, bidirectional prediction video coding techniques are applied to the video content. The encoder analyzes two lists of reference frames to determine whether a third mode of bidirectional prediction is redundant to a second mode of bidirectional prediction. The efficiency of video coding is improved by refraining from sending data symbols to the decoder in the event that the video coding techniques contain redundancy.	07-26-2012
20120287999	SYNTAX ELEMENT PREDICTION IN ERROR CORRECTION - Architecture that improves error robustness in video coding and decoding. In particular, this can apply to motion vector prediction (MVP) such as a temporal MVP (TMVP). Flags can be used to indicate the use or non-use of a feature, such as to indicate whether the current slice uses or does not use TMVP, and to indicate in the slice header whether list prediction is allowed or not allowed. A flag can be signaled in sequence parameter set (SPS) or picture parameter set (PPS) as a way to enable an entire sequence to use or not use TMVP. TVMP can also be used to copy all the reference motion information to the current block. To address possible error problems, the full index of the TMVP can be recorded, and temporal information decoding refresh (TIDR) can be inserted into slices periodically.	11-15-2012
20130051452	VIDEO ENCODING ENHANCEMENTS - Techniques for implementing video encoding enhancements may increase video encoding efficiency. One of the techniques may involve the use of an exception value and a maximum index value in inter-prediction coding lookup table to enhance the combined coding of an inter-prediction direction and a reference frame index for a coding unit. Another of the techniques may include adaptively ordering the encoding of a split flag value and a skip flag value for a coding unit. An additional technique may include providing a uniform approach to adaptively combined code the coded block flag (CBF) values for transform or coding units of images that are used with the Residue Quad-tree (RQT) information to represent residue information.	02-28-2013
20130170549	MERGE MODE FOR MOTION INFORMATION PREDICTION - Disclosed herein are representative embodiments of processing digital image data. In one exemplary embodiment disclosed herein, for a current block of a first frame of digital image data, a list of motion vector prediction information for the current block is populated with candidate motion vector prediction data that includes default motion vector prediction data. In another exemplary embodiment disclosed herein, at least a portion of a coded video bitstream is received and a merge flag for a current block in a current frame is decoded. After the merge flag is decoded, at least one merge candidate for the current block is determined.	07-04-2013
20130170550	REPRESENTATIVE MOTION INFORMATION FOR TEMPORAL MOTION PREDICTION IN VIDEO ENCODING AND DECODING - Disclosed herein are representative embodiments of generating representative motion information that can be used during processing of a video frame. In one exemplary embodiment disclosed herein, a reference frame comprising a group of blocks is processed, and motion information for the group of blocks is compressed at least by buffering representative motion-vector information and representative reference-frame index information for the group of blocks. The representative reference-frame index information comprises reference-frame index information of a representative block of the group of blocks, and the representative reference-frame index information represents reference-frame index information for the group of blocks during processing of a current frame.	07-04-2013
20130170556	VARIABLE LENGTH CODING AND DECODING USING COUNTERS - Disclosed herein are representative embodiments for performing entropy coding or decoding using a counter-based scheme. In one exemplary embodiment disclosed herein, a first codeword is received from compressed digital media data. The first codeword is decoded into a first digital media data value by referencing a codeword table that associates the first codeword with the first digital media data value and a second codeword with a second digital media data value. A counter for counting occurrences of the first digital media data value is incremented. The value of the first counter is compared with the value of a second counter that counts occurrences of a second digital media data value. If the value of the first counter and the value of the second counter are equal (or greater than or equal), the codeword table is updated to swap codewords between the first and second digital media values.	07-04-2013
20130343462	Coded-Block-Flag Coding and Derivation - Techniques for coding and deriving (e.g., determining) one or more coded-block-flags associated with video content are described herein. A coded-block-flag of a last node may be determined when coded-block-flags of preceding nodes are determined to be a particular value and when a predetermined condition is satisfied. In some instances, the predetermined condition may be satisfied when log	12-26-2013
20140056347	Non-Transform Coding - Techniques for selectively transforming one or more coding units when coding video content are described herein. The techniques may include determining whether or not to transform a particular coding unit. The determination may be based on a difference in pixel values of the particular coding unit and/or one or more predefined rate-distortion constraints. When it is determined to not perform a transform, the particular coding unit may be coded without transforming the particular coding unit.	02-27-2014
20140105274	Video Coding Redundancy Reduction - Embodiments for reducing redundancy in video coding are disclosed. In accordance with at least one embodiment, video content is represented as a tree structure in which the nodes of the tree structure are associated with attributes of the video content. An encoder analyzes the nodes to determine whether there is redundancy associated with a last node of a divided node. In another embodiment, bidirectional prediction video coding techniques are applied to the video content. The encoder analyzes two lists of reference frames to determine whether a third mode of bidirectional prediction is redundant to a second mode of bidirectional prediction. The efficiency of video coding is improved by refraining from sending data symbols to the decoder in the event that the video coding techniques contain redundancy.	04-17-2014
20140204088	SURFACE CODEC USING REPROJECTION ONTO DEPTH MAPS - A surface reprojection codec and method for surface compression using non-redundant surface projection onto depth maps. A multiple depth map encoder takes a two-dimensional (2D) surface that is a representation of a three-dimensional (3D) object and divides it into a plurality of surface patches. Each of these surface patches is projected onto a depth map from a set of depth maps. This generates a set of converted depth maps. This set of converted depth maps then are encoded using standard encoding techniques. The encoded version of the 3D object may be stored, transmitted over a network, or both. A multiple depth map decoder decodes the set of converted depth maps to obtain the surface patches. These surface patches and connectivity information can be used to regenerate the 2D surface. The 2D surface in turn can be used to reconstruct the 3D object.	07-24-2014

Patent applications by Jizheng Xu, Beijing CN